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PREFACE. 


The elementary geometry, which is learned in school, deals 
with basic concepts such as a point, a straight line, a segment. 
They are used to compose more complicated concepts: a polygo- 
nal line, a polygon, a polyhedron. Some curvilinear forms are also 
considered: a circle, a cylinder, a cone, a sphere, a ball. 

The analytical geometry basically deals with the same geomet- 
ric objects as the elementary geometry does. The difference is in 
a method for studying these objects. The elementary geometry 
relies on visual impressions and formulate the properties of geo- 
metric objects in its axioms. From these axioms various theorems 
are derived, whose statements in most cases are also revealed in 
visual impressions. The analytical geometry is more inclined to a 
numeric description of geometric objects and their properties. 

The transition from a geometric description to a numeric de- 
scription becomes possible due to coordinate systems. Each 
coordinate system associate some groups of numbers with geo- 
metric points, these groups of numbers are called coordinates of 
points. The idea of using coordinates in geometry belongs French 
mathematician Rene Descartes. Simplest coordinate systems 
suggested by him at present time are called Cartesian coordinate 
systems. 

The construction of Cartesian coordinates particularly and 
the analytical geometry in general are based on the concept of 
a vector. The branch of analytical geometry studying vectors is 
called the vector algebra. The vector algebra constitutes the first 
chapter of this book. The second chapter explains the theory of 
straight lines and planes and the theory of curves of the second 
order. In the third chapter the theory of surfaces of the second 
order is explained in brief. 

The book is based on lectures given by the author during 
several years in Bashkir State University. It was planned as the 
first book in a series of three books. However, it happens that 
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8 PREFACE. 


the second and the third books in this series were written and 
published before the first book. These are 


— «Course of linear algebra and multidimensional geometry» [1]; 
— «Course of differential geometry» [2]. 


Along with the above books, the following books were written: 


— «Representations of finite group» [3]; 

— «Classical electrodynamics and theory of relativity» [4]; 

— «Quick introduction to tensor analysis» [5]. 

— «Foundations of geometry for university students and high 
school students» [6]. 


The book [3] can be considered as a continuation of the book 
[1] which illustrates the application of linear algebra to another 
branch of mathematics, namely to the theory of groups. The 
book [4] can be considered as a continuation of the book [2]. It 
illustrates the application of differential geometry to physics. The 
book [5] is a brief version of the book [2]. As for the book [6], by 
its subject it should precede this book. It could br recommended 
to the reader for deeper logical understanding of the elementary 
geometry. 

I am grateful to Prof. R. R. Gadylshin and Prof. D. I. Borisov 
for reading and refereeing the manuscript of this book and for 
valuable advices. I am grateful to Mr. Eric Larouche who has 
read the book and provided me the errata list for the first English 
edition of this book. 


June, 2013. R. A. Sharipov. 


CHAPTER I 


VECTOR ALGEBRA. 


§1. Three-dimensional Euclidean space. 
Acsiomatics and visual evidence. 


Like the elementary geometry explained in the book [6], 
the analytical geometry in this book is a geometry of three- 
dimensional space E. We use the symbol E for to denote the 
space that we observe in our everyday life. Despite being seem- 
ingly simple, even the empty space E possesses a rich variety 
of properties. These properties reveal through the properties of 
various geometric forms which are comprised in this space or 
potentially can be comprised in it. 

Systematic study of the geometric forms in the space E was 
initiated by ancient Greek mathematicians. It was Euclid who 
succeeded the most in this. He has formulated the basic prop- 
erties of the space E in five postulates, from which he derived 
all other properties of E. At the present time his postulates 
are called axioms. On the basis of modern requirements to the 
rigor of mathematical reasoning the list of Euclid’s axioms was 
enlarged from five to twenty. These twenty axioms can be found 
in the book [6]. In favor of Euclid the space that we observe in 
our everyday life is denoted by the symbol E and is called the 
three-dimensional Euclidean space. 

The three-dimensional Euclidean point space E consists of 
points. All geometric forms in it also consist of points. subsets 


— 


of the space E. Among subsets of the space E straight lines 
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and planes (see Fig. 1.2) play an especial role. They are used 
in the statements of the first eleven Euclid’s axioms. On the 
base of these axioms the concept of a segment (see Fig. 1.2) is 
introduced. The concept of a segment is used in the statement of 
the twelfth axiom. 

The first twelve of Euclid’s axioms appear to be sufficient 
to define the concept of a ray and the concept of an angle 
between two rays outgoing from the same point. The concepts 


OO 


BN 


Fig. 1.1 Fig. 1.2 


of a segment and an angle along with the concepts of a straight 
line and a plane appear to be sufficient in order to formulate 
the remaining eight Euclid’s axioms and to build the elementary 
geometry in whole. 

Even the above survey of the book [6], which is very short, 
shows that building the elementary geometry in an axiomatic way 
on the basis of Euclid’s axioms is a time-consuming and laborious 
work. However, the reader who is familiar with the elementary 
geometry from his school curriculum easily notes that proof of 
theorems in school textbooks are more simple than those in [6]. 
The matter is that proofs in school textbooks are not proofs in 
the strict mathematical sense. Analyzing them carefully, one can 
find in these proofs the usage of some non-proved propositions 
which are visually obvious from a supplied drawing since we have 
a rich experience of living within the space E. Such proofs can 
be transformed to strict mathematical proofs by filling omissions, 
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i.e. by proving visually obvious propositions used in them. 

Unlike [6], in this book I do not load the reader by absolutely 
strict proofs of geometric results. For geometric definitions, 
constructions, and theorems the strictness level is used which is 
close to that of school textbooks and which admits drawings and 
visually obvious facts as arguments. Whenever it is possible I 
refer the reader to strict statements and proofs in [6]. As far as 
the analytical content is concerned, i.e. in equations, in formulas, 
and in calculations the strictness level is applied which is habitual 
in mathematics without any deviations. 


§2. Geometric vectors. Vectors bound to points. 


DEFINITION 2.1. A geometric vectors AB’ is a straight line 
segment in which the direction from the point A to the point B 
is specified. The point A is called the initial point of the vector 
AB, while the point B is called its terminal point. 


The direction of the vector AB’ in drawing is marked by an 
arrow (see Fig. 2.1). For this reason vectors sometimes are called 
directed segments. 
ae: Each segment [AB] is associ- 
ated with two different vectors: 
A re and Be . The vector BA is 
usually called the opposite vector 
Fig. 2.1 for the vector AB. 

Note that the arrow sign on 
the vector AB’ and bold dots at the ends of the segment [AB] 
are merely symbolic signs used to make the drawing more clear. 
When considered as sets of points the vector AB and the segment 
[AB] do coincide. 

A direction on a segment, which makes it a vector, can mean 
different things in different situations. For instance, drawing a 
vector AB on a geographic map, we usually mark the displace- 
ment of some object from the point A to the point B. However, 
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—> 
if it is a weather map, the same vector AB can mean the wind 
direction and its speed at the point A. In the first case the 
length of the vector AB is proportional to the distance between 
the points A and B. In the second case the length of AB is 
proportional to the wind speed at the point A. 

There is one more difference in the above two examples. In 
the first case the vector AB is bound to the points A and B by 
its meaning. In the second case the vector AB is bound to the 
point A only. The fact that its arrowhead end is at the point B is 
a pure coincidence depending on the scale we used for translating 
the wind speed into the length units on the map. According to 
what was said, geometric vectors are subdivided into two types: 


1) purely geometric; 
2) conditionally geometric. 


Only displacement vectors belong to the first type; they actu- 
ally bind some two points of the space E. The lengths of these 
vectors are always measured in length units: centimeters, meters, 
inches, feets etc. 

Vectors of the second type are more various. These are velocity 
vectors, acceleration vectors, and force vectors in mechanics; 
intensity vectors for electric and magnetic fields, magnetization 
vectors in magnetic materials and media, temperature gradients 
in non-homogeneously heated objects et al. Vectors of the second 
type have a geometric direction and they are bound to some 
point of the space E, but they have not a geometric length. 
Their lengths can be translated to geometric lengths only upon 
choosing some scaling factor. 

Zero vectors or null vectors hold a special position among 
geometric vectors. They are defined as follows. 


DEFINITION 2.2. A geometric vector of the space E whose 
initial and terminal points do coincide with each other is called a 
zero vector or a null vector. 


A geometric null vector can be either a purely geometric vector 


§3. EQUALITY OF VECTORS. 13 
or a conditionally geometric vector depending on its nature. 
§ 3. Equality of vectors. 


DEFINITION 3.1. Two geometric vectors AB and CD are 
called equal if they are equal in length and if they are codirected, 
i.e. |AB| = |C D| and AB f CD. 


The vectors AB and CD are said to be codirected if they lie 
on a same line as shown in Fig. 3.1 of if they lie on parallel lines 
as shown in Fig. 3.2. In both cases they should be pointing in the 


D 
C C 
p ee 
Fig. 3.1 Fig. 3.2 


same direction. Codirectedness of geometric vectors and their 
equality are that very visually obvious properties which require 
substantial efforts in order to derive them from Euclid’s axioms 
(see [6]). Here I urge the reader not to focus on the lack of rigor 
in statements, but believe his own geometric intuition. 

Zero geometric vectors constitute a special case since they do 
not fix any direction in the space. 


DEFINITION 3.2. All null vectors are assumed to be codirected 
to each other and each null vector is assumed to be codirected to 
any nonzero vector. 


The length of all null vectors is zero. However, depending on 
the physical nature of a vector, its zero length is complemented 
with a measure unit. In the case of zero force it is zero newtons, 
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in the case of zero velocity it is zero meters per second. For this 
reason, testing the equality of any two zero vectors, one should 
take into account their physical nature. 


DEFINITION 3.3. All null vectors of the same physical nature 
are assumed to be equal to each other and any nonzero vector is 
assumed to be not equal to any null vector. 


Testing the equality of nonzero vectors by means of the defi- 
nition 3.1, one should take into account its physical nature. The 
equality |AB| = |CD| in this definition assumes not only the 
equality of numeric values of the lengths of AB’ and CD , but 
assumes the coincidence of their measure units as well. 

A remark. Vectors are geometric forms, i.e. they are sets 
of points in the space E. However, the equality of two vectors 
introduced in the definition 3.1 differs from the equality of sets. 


§4. The concept of a free vector. 


Defining the equality of vectors, it is convenient to use parallel 
translations. Each parallel translation is a special transformation 
of the space p: E — E under which any straight line is mapped 
onto itself or onto a parallel line and any plane is mapped onto 
itself or onto a parallel plane. When applied to vectors, parallel 
translation preserve their length and their direction, i.e. they 
map each vector onto a vector equal to it, but usually being in a 
different place in the space. The number of parallel translations 
is infinite. As appears, the parallel translations are so numerous 
that they can be used for testing the equality of vectors. 


DEFINITION 4.1. A geometric vector CD is called equal to a 
geometric vector AB if there is a parallel translation p: E> E 
that maps the vector AB’ onto the vector CD’ i.e. such that 
p(A) =C and p(B) = D. 


The definition 4.1 is equivalent to the definition 3.1. I do 
not prove this equivalence, relying on its visual evidence and 
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assuming the reader to be familiar with parallel translations from 
the school curriculum. A more meticulous reader can see the 
theorems 8.4 and 9.1 in Chapter VI of the book [6]. 


THEOREM 4.1. For any two points A and C in the space E 
there is exactly one parallel translation p: E —> E mapping the 
point A onto the point C, i.e. such that p(A) = C. 


The theorem 4.1 is a visually obvious fact. On the other hand 
it coincides with the theorem 9.3 from Chapter VI in the book 
[6], where it is proved. For these two reasons we exploit the 
theorem 4.1 without proving it in this book. 

Lei’s apply the theorem 4.1 to some geometric vector AB’ . Let 
C be an arbitrary point of the space E and let p be a parallel 
translation taking the point A to the point C. The existence 
and uniqueness of such a parallel translation are asserted by the 
theorem 4.1. Let’s define the point D by means of the formula 
D = p(B). Then, according to the definition 4.1, we have 


AB’ = CD. 


—> 
These considerations show that each geometric vector AB has a 
copy equal to it and attached to an arbitrary point C € E. In 
the other words, by means of parallel translations each geometric 


vector AB can be replicated up to an infinite set of vectors equal 
to each other and attached to all points of the space E. 


DEFINITION 4.2. A free vector is an infinite collection of geo- 
metric vectors which are equal to each other and whose initial 
points are at all points of the space E. Each geometric vector in 
this infinite collection is called a geometric realization of a given 
free vector. 


Free vectors can be composed of purely geometric vectors or 
of conditionally geometric vectors as well. For this reason one 
can consider free vectors of various physical nature. 
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In drawings free vectors are usually presented by a single geo- 
metric realization or by several geometric realizations if needed. 

Geometric vectors are usually denoted by two capital letters: 
AB, CD, EF etc. Free vectors are denoted by single lowercase 
letters: a, b, C etc. Arrows over these letters are often omitted 
since it is usually clear from the context that vectors are con- 
sidered. Below in this book I will not use arrows in denoting 
free vectors. However, I will use boldface letters for them. In 
many other books, but not in my book [1], this restriction is also 
removed. 


§5. Vector addition. 


Assume that two free vectors a and b are given. Let’s choose 
some arbitrary point A and consider the geometric realization of 
the vector a with the initial point A. Let’s denote through B 
the terminal point of this geometric realization. As a result we 
get a = AB. Then we consider the geometric realization of the 
vector b with initial point B and denote through C its terminal 
point. This yields b = BC. 


Pea f 
DEFINITION 5.1. The geometric vector AC connecting the 
initial point of the vector AB with the terminal point of the 
2, oo a3 
vector BC is called the sum of the vectors AB and BC: 


AC’ = AB’ + BC. (5.1) 


The vector AC constructed by means of the vectors a = AB’ 
and b = BC can be replicated up to a free vector c by parallel 
translations to all points of the space E. Such a vector c is 
naturally called the sum of the free vectors a and b. For this 
vector we write c = a+b. The correctness of such a definition is 
guaranteed by the following lemma. 


LEMMA 5.1. The sum c = a+b = AC of two free vectors 
a = AB and b = BC expressed by the formula (5.1) does not 
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depend on the choice of a point A at which the geometric realiza- 
tion AB of the vector a begins. 


PROOF. In addition to A, let’s choose another initial point FE. 
Then in the above construction of the sum a+b the vector a has 
F 


B 


Fig. 5.1 


—> ——> 
two geometric realizations AB and EF’. The vector b also has 
two geometric realizations BC’ and FG (see Fig. 5.1). Then 


AB = EF, BC = FG. (5.2) 


Instead of (5.1) now we have two equalities 


AC = AB + BC, EG = EF +FG. (5.3) 


Let’s denote through p a parallel translation that maps the 
point A to the point E, i.e. such that p(A) = E. Due to the 
theorem 4.1 such a parallel translation does exist and it is unique. 
From p(A) = E and from the first equality (5.2), applying the 
definition 4.1, we derive p(B) = F. Then from p(B) = F and 
from the second equality (5.2), again applying the definition 4.1, 
we get p(C) = G. As a result we have 
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The relationships (5.4) mean that the parallel translation p maps 


the vector AC. to the vector EG’. Due to the definition 4.1 this 
fact yields AC = EG. Now from the equalities (5.3) we derive 


AB’ + BC = EF +FG. (5.5) 


The equalities (5.5) complete the proof of the lemma 5.1. 


The addition rule given by the formula (5.1) is called the 
triangle rule. It is associated with the triangle ABC in Fig. 5.1. 


86. Multiplication of a vector by a number. 


Let a be some free vector. Let’s choose some arbitrary point 
A and consider the geometric realization of the vector a with 
initial point A. Then we denote through B the terminal point of 
this geometric realization of a. Let œ be some number. It can be 


5 C A B A 5 
C 
Fig. 6.1 Fig. 6.2 Fig. 6.3 


either positive, negative, or zero. 
Let a > 0. In this case we lay a point C onto the line AB so 
that the following conditions are fulfilled: 


AC’ tt AB’, |AC] = al - |ABI. (6.1) 


As a result we obtain the drawing which is shown in Fig. 6.1. 

If a = 0, we lay the point C so that it coincides with the point 
A. In this case the vector AC’ appears to be zero as shown in 
Fig. 6.2 and we have the relationship 


|AC| = |a| - | ABI. (6.2) 
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In the case œ < 0 we lay the point C onto the line AB so that 
the following two conditions are fulfilled: 


ACHAB, |AC] = Jal - |AB]. (6.3) 


This arrangement of points is shown in Fig. 6.3. 


DEFINITION 6.1. In each of the three cases a > 0, a = 0, and 
a < 0 the geometric vector AC defined through the vector AB 
according to the drawings in Fig. 6.1, in Fig. 6.2, and in Fig. 6.3 
and according to the formulas (6.1), (6.2), and (6.3) is called 
the product of the vector AB by the number a. This fact is 
expressed by the following formula: 


B. 


AC = aA (6.4) 


The case a = O is not covered by the above drawings in 
Fig. 6.1, in Fig. 6.2, and in Fig. 6.3. In this case the point B 
coincides with the points A and we have |AB| = 0. In order 
to provide the equality |AC| = |a| - |AB| the point C is chosen 
coinciding with the point A. Therefore the product of a null 
vector by an arbitrary number is again a null vector. 

The geometric vector AC’ constructed with the use of the 
vector a= AB and the number a can be replicated up to a free 
vector c by means of the parallel translations to all points of the 
space E. Such a free vector c is called the product of the free 
vector a by the number a. For this vector we write c = qa. a. 
The correctness of this definition of c = a-a is guaranteed by the 
following lemma. 


LEMMA 6.1. The product c = a:a= AC’ of a free vector 
a = AB by a number a expressed by the formula (6.4) does 
not depend on the choice of a point A at which the geometric 
realization of the vector a is built. 
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PROOF. Let’s prove the lemma for the case a 4 0 and a > 0. 
In addition to A we choose another initial point Æ. Then in the 


Fig. 6.4 


construction of the product a -a the vector a gets two geometric 
realizations AB and EF (see Fig. 6.4). Hence we have 


AB = EF. (6.5) 


Let’s denote through p the parallel translation that maps 
the point A to the point E, i.e. such that p(A) = E. Then 
from the equality (6.5), applying the definition 4.1, we derive 
p(B) = F. The point C is placed on the line AB at the distance 
|AC| = |a| - |AB| from the point A in the direction of the vector 
AB’ Similarly, the point G is placed on the line EF at thew 
distance |EG| = |a| - |EF| from the point E in the direction of 
the vector EF’. From the equality (6.5) we derive |AB| = |EF'. 
Therefore |AC| = |a|-|AB| and |EG| = |a| - |EF| mean that 
|AC| = |EF|. Due to p(A) = E and p(B) = F the parallel 
translation p maps the line AB onto the line EF. It preserves 
lengths of segments and maps codirected vectors to codirected 


ones. Hence p(C) = G. Along with pi = E due to the 
definition 4.1 the equality p(C) = G yields AC = EG, i.e. 


a- AB =a- EF. 


The lemma 6.1 is proved for the case a Æ 0 and a > 0. Its proof 
for the other cases is left to the reader as an exercise. 


EXERCISE 6.1. Consider the cases a = 0 and a < 0 fora # 0 
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and consider the case a = 0. Prove the lemma 6.1 for these cases 
and provide your proof with drawings analogous to that of Fig 6.4. 


§7. Properties of the algebraic operations with vectors. 


The addition of vectors and their multiplication by numbers 
are two basic algebraic operations with vectors in the three- 
dimensional Euclidean point space E. Eight basic properties of 
these two algebraic operations with vectors are usually consid- 
ered. The first four of these eight properties characterize the 
operation of addition. The other four characterize the operation 
of multiplication of vectors by numbers and its correlation with 
the operation of vector addition. 


THEOREM 7.1. The operation of addition of free vectors and 
the operation of their multiplication by numbers possess the fol- 
lowing properties: 


1) commutativity of addition: a+ b = b + a; 

) associativity of addition: (a+b)+c=a+(b+c); 
3) the feature of the null vector: a+ 0 =a; 

) the existence of an opposite vector: for any vector a there is 
an opposite vector a’ such that a+ a’ = 0; 
5) distributivity of multiplication over the addition of vectors: 

k-(a+b)=k-a+k-b; 

6) distributivity of multiplication over the addition of numbers: 
(kK+q)-a=k-a+q-a; 
7) associativity of multiplication by numbers: (k q)-a = k- (q-a); 
) the feature of the numeric unity: 1-a=a. 


Let’s consider the properties listed in the theorem 7.1 one 
by one. Let’s begin with the commutativity of addition. The 
sum a+b in the left hand side of the equality a +b = b +a 
is calculated by means of the triangle rule upon choosing some 
geometric realizations a= AB and b = BC as shown in Fig. 7.1. 

Let’s draw the line parallel to the line BC and passing 
through the point A. Then we draw the line parallel to the 
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line AB and passing through the 
point C. Both of these lines 
are in the plane of the triangle 
ABC. For this reason they in- 
tersect at some point D. The 
segments [AB], [BC], [CD], and 
[DA] form a parallelogram. 
—> 
l Let’s | mark the vectors DC 
Fig. 7.1 and AD on the segments [CD] 
and [DA]. It is easy to see that the vector DC is produced from 
the vector AB by applying the parallel translation from the point 
A to the point D. Similarly the vector AD is produced from the 
vector BC by applying the parallel translation from the point B 
to the point A. Therefore DC = AB = a and BC = AD =b. 
Now the triangles ABC and ADC yield 


AC’ = AB’+ BO =a+b, 
AC = AD +D =b+a. 


(7.1) 


From (7.1) we derive the required equality a+ b = b + a. 

The relationship a+ b = b+a and Fig. 7.1 yield another 
method for adding vectors. It is called the parallelogram rule. In 
order to add two vectors a and b their geometric realizations AB 
and AD’ with the common initial point A are used. They are 
completed up to the parallelogram ABCD. Then the diagonal 
of this parallelogram is taken for the geometric realization of the 
sum: a+ b= AB +: AD’ = AC’ 


EXERCISE 7.1. Prove the equality a+ b = b + a for the case 
where a || b. For this purpose consider the subcases 


1) aff b; 2) ab and |al| > |bj; 
3) atlb and |a|=|bl; 4) atlb and |al < |b]. 
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The next property in the the- 
orem 7.1 is the associativity of 
the operation of vector addition. 
In order to prove this property 
we choose some arbitrary initial 
point A and construct the follow- 
ing geometric realizations of the 
vectors: a= AB, b= BC. and 

Fig. 7.2 c= CD’. Applying the trian- 
gle rule of vector addition to the 
triangles ABC and ACD, we get the relationships 


atban + BC = AC. 


> > > (7.2) 
(a+b)+c= AC +CD = AD 


(see Fig. 7.2). Applying the same rule to the triangles BCD and 
ABD, we get the analogous relationships 


b4+c=BC+CD =PD, 


(7.3) 
a+(b+c)=AB+BD=AD. 


The required relationship (a+b) + c = a + (b + c) now is 
immediate from the formulas (7.2) and (7.3). 

A remark. The tetragon ABCD in Fig. 7.2 is not necessarily 
planar. For this reason the line CD is shown as if it goes under 
the line AB, while the line BD is shown going over the line AC. 

The feature of the null vector a+ 0 = a is immediate from the 
triangle rule for vector addition. Indeed, if an initial point A for 
the vector a is chosen and if its geometric realization AB is built, 
then the null vector O is presented by its geometric realization 
BB. From the definition 5.1 we derive AB + BB = AB which 
yields a + 0 = a. 

The existence of an opposite vector is also easy to prove. 
Assume that the vector a is presented by its geometric realization 


24 CHAPTER I. VECTOR ALGEBRA. 


—> => 
AB. Let’s consider the opposite geometric vector BA and let’s 
denote through a’ the corresponding free vector. Then 


a+a' = AB + BA = AA = 0. 


The distributivity of multiplication over the vector addition 
follows from the properties of the homothety transformation in 
the Euclidean space E (see §11 of Chapter VI in [6]). It is 
sometimes called the similarity transformation, which is not quite 
exact. Similarity transformations constitute a larger class of 
transformations that comprises homothety transformations as a 
subclass within it. 

Let a Y b and let the sum of vectors a + b is calculated 
according to the triangle rule as shown in Fig. 7.3. Assume that 


Fig. 7.3 Fig. 7.4 


k > 0. Let’s construct the homothety transformation hga: E > E 
with the center at the point A and with the homothety factor 
k. Let’s denote through E the image of the point B under the 
transformation hka and let’s denote through F the image of the 
point C under this transformation: 


E =hx,(B), F = hpa (C). 


Due to the properties of the homothety the line EF is parallel to 
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the line BC and we have the following relationships: 


EFN BC, |EF| = |k| - |BC], 
AE Tt AB, |AE| = |k| - |ABI, (7.4) 
AF Tt AC, | AF| = |k| - |AC]. 


Comparing (7.4) with (6.1) and taking into account that we 
consider the case k > 0, from (7.4) we derive 


AE =k-AB, EF=k-BC, AF=k-AC. (75) 


The relationships (7.5) are sufficient for to prove the distributiv- 
ity of the mutiplication of vectors by numbers over the operation 
of vector addition. Indeed, from (7.5) we obtain: 


k-(a+b) =k: (AB + BC) =k- AC =AF = 


> > > > (7.6) 
= AE +EF =k. AB +k-BC =k-at+k-b. 


The case where a }{ b and k < 0 is very similar to the case just 
above. In this case Fig. 7.3 is replaced by Fig. 7.4. Instead of the 
relationships (7.4) we have the relationships 


EF BC, |EF| =|k| -|BCI, 
AE AB’ |AE| = |k| - |AB|, (7.7) 
AFN AC, |AF]| = |k| - [AC]. 


Taking into account k < 0 from (7.7) we derive (7.5) and (7.6). 
In the case k = 0 the relationship k- (a+b) =k-a+k-b 
reduces to the equality 0 =0+0. This equality is trivially valid. 
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EXERCISE 7.2. Prove that k- (a+b) =k-a+k-b for the case 
a || b. For this purpose consider the subcases 


1) aff b; 2) ab and |al| > |b; 
3) ab and |a| = |b]; 4) at}b and |a| < |b]. 


In each of these subcases consider two options: k > 0 and k < 0. 


Let’s proceed to proving the distributivity of multiplication of 
vectors by numbers over the addition of numbers. The case a = 0 
in the equality (k +q)-a=k-a+q-ais trivial. In this case the 
equality (k +q)-a=k-a+q-a reduces to 0 = 0 + 0. 

The cases k = 0 and q = 0 are also trivial. In these cases 
the equality (k + q)-a = k-a+q-a reduces to the equalities 
q:a=0+q-aandk-a=k-a+0O respectively. 

Let’s consider the case a # O and for the sake of certainty 
let’s assume that k > 0 and q > 0. Let’s choose some arbi- 

pa trary point A and let’s build the 

ad: geometric realizations of the vec- 

C tor k -a with the initial point A. 

A B Let B be the terminal point of 

this geometric realization. Then 

AB = k.a. Similarly, we con- 

P struct the geometric realization 

BC = q-a. Due to k > 0 and q > 0 the vectors AB and BC 

are codirected to the vector a. These vectors lie on the line AB 
(see Fig. 7.5). The sum of these two vectors 


Fig. 7.5 


AC = AB + BC (7.8) 


lie on the same line and it is codirected to the vector a. The 
length of the vector AC is given by the formula 


|AC| = |AB| + |BC| = k |a| + q |a| = (k + q) |al. (7.9) 
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Due to AC Tl a and due to k + q > 0 from (7.9) we derive 


AC’ = (k+q)-a. (7.10) 


Let’s substitute (7.10) and (7.8) and take into account the re- 
lationships AB = k.a and BC = q-a which follow from 


our constructions. As a result we get the required equality 
(k+q)-a=k-at+q-a. 


EXERCISE 7.3. Prove that (k+q):a=k-a+q-a for the case 
where a £ 0, while k and q are two numbers of mutually opposite 
signs. For the case consider the subcases 


1) |k] > lal; 2) [k| = lal; 3) |k] < Iq. 


The associativity of the multiplication of vectors by numbers 
is expressed by the equality (kq)-a=k-(q-a). If a= 0, this 
equality is trivially fulfilled. It reduces to 0 = 0. If k = 0 or if 
q = 0, it is also trivial. In this case it reduces to 0 = 0. 

Let’s consider the case a 4 0, k > 0, and q > 0. Let’s choose 
some arbitrary point A in the space E and build the geometric 

bia realization of the vector q-a with 

the initial point A. Let B be the 

C terminal point of this geometric 
B realization. Then AB = q-a 
(see Fig. 7.6). Due to q > 0 the 

Fig. 7.6 vector AB is codirected with the 
5 5 vector a. Let’s build the vector 
AC as the product AC = k. AB = k- (q-a) relying upon the 
definition 6.1. Due to k > 0 the vector AC is also codirected 


with a. The lengths of AB’ and AC are given by the formulas 


|AB| = qlal, |AC| = k |AB|. (7.11) 
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From the relationships (7.11) we derive the equality 
|AC| = k (lal) = (kq) jal. (7.12) 


The equality (7.12) combined with AC’ jj a and kq > 0 yields 
AC’ = (kq)-a. By our construction AC 2 AB ae: (q-a). 
As a result now we immediately derive the required equality 
(kq)-a= k- (q-a). 


EXERCISE 7.4. Prove the equality (kq) -a = k- (q-a) in the 
case where a Æ 0, while the numbers k and q are of opposite signs. 
For this case consider the following two subcases: 


1) k>0 and q< 0; 2) k<0 and q>0. 


The last item 8 in the theorem 7.1 is trivial. It is immediate 
from the definition 6.1. 


§ 8. Vectorial expressions and their transformations. 


The properties of the algebraic operations with vectors listed 
in the theorem 7.1 are used in transforming vectorial expressions. 
Saying a vectorial expression one usually assumes a formula such 
that it yields a vector upon performing calculations according 
to this formula. In this section we consider some examples of 
vectorial expressions and learn some methods of transforming 
these expressions. 

Assume that a list of several vectors a, ... , an is given. Then 
one can write their sum setting brackets in various ways: 


(ay a2) (a3 H (a4 H s+ (Anat bap) cx), 


ee nee ee ee Oe eee 


There are many ways of setting brackets. The formulas (8.1) 
show only two of them. However, despite the abundance of the 
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ways for brackets setting, due to the associativity of the vector 
addition (see item 2 in the theorem 7.1) all of the expressions 
like (8.1) yield the same result. For this reason the sum of the 
vectors aj, ... , a, can be written without brackets at all: 


aj + a2 + a3 + a4 +... +an-1ı +a. (8.2) 


In order to make the formula (8.2) more concise the summation 
sign is used. Then the formula (8.2) looks like 


ai +a +... +an =) a. (8.3) 
i=l 


The variable 7 in the formula (8.3) plays the role of the cycling 
variable in the summation cycle. It is called a summation indez. 
This variable takes all integer values ranging from i= 1 toi =n. 
The sum (8.3) itself does not depend on the variable i. The 
symbol i in the formula (8.3) can be replaced by any other 
symbol, e.g. by the symbol j or by the symbol k: 


aes a =f ae (8.4) 
i=1 j=l k=1 


The trick with changing (redesignating) a summation index used 
in (8.4) is often applied for transforming expressions with sums. 
The commutativity of the vector addition (see item 1 in the 
theorem 7.1) means that we can change the order of summands 
in sums of vectors. For instance, in the sum (8.2) we can write 


aj taz +... tan = an + an-ı +... +a. 
The most often application for the commutativity of the vector 


addition is changing the summation order in multiple sums. 
Assume that a collection of vectors aj; is given which is indexed 
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by two indices i and j, where i = 1,...,m and j =1,...,n. 
Then we have the equality 


i=1 j=1 j=l i=l 


that follows from the commutativity of the vector addition. In 
the same time we have the equality 


i=1 j=1 j=l i=l 
which is obtained by redesignating indices. Both methods of 
transforming multiple sums (8.5) and (8.6) are used in dealing 
with vectors. 
The third item in the theorem 7.1 describes the property of 
the null vector. This property is often used in calculations. If the 
sum of a part of summands in (8.3) is zero, e.g. if the equality 


n 
ak+ı +... + an = > a; =0 
i=k+1 


is fulfilled, then the sum (8.3) can be transformed as follows: 


n k 
art...tan= a;= 5 aj=a,+...+ ag. 
i=l i=1 


The fourth item in the theorem 7.1 declares the existence of 
an opposite vector a’ for each vector a. Due to this item we can 
define the subtraction of vectors. 


DEFINITION 8.1. The difference of two vectors a — b is the 
sum of the vector a with the vector b’ opposite to the vector b. 
This fact is written as the equality 


a—b=a+b’. (8.7) 
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EXERCISE 8.1. Using the definitions 6.1 and 8.1, show that the 
opposite vector a’ is produced from the vector a by multiplying 
it by the number —1, i.e. 


a’ =(-l)-a. (8.8) 
Due to (8.8) the vector a! opposite to a is denoted through —a 
and we write a' = —a = (—1)-a. 


The distributivity properties of the multiplication of vectors 
by numbers (see items 5 and 6 in the theorem 7.1) are used for 
expanding expressions and for collecting similar terms in them: 


Ene 9 


i=l i=1 
(Soa)-a= Sasa (8.10) 
i=1 i=1 


Transformations like (8.9) and (8.10) can be found in various 
calculations with vectors. 


EXERCISE 8.2. Using the relationships (8.7) and (8.8), prove 
the following properties of the operation of subtraction: 
a— a = 0; (a+b) — 
(a—b)+c=a-(b-—-c); (a-b) 


a:-(a—b)=a-a-a-b; (a-—f)-a=a-a-B-a. 


Here a, b, and c are vectors, while œ and 8 are numbers. 


The associativity of the multiplication of vectors by numbers 
(see item 7 in the theorem 7.1) is used expanding vectorial 
expressions. Here is an example of such usage: 
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In multiple sums this property is combined with the commuta- 
tivity of the regular multiplication of numbers by numbers: 


So ai (Zs ; ay) = 8 (Soa ay): 
i=1 j=1 j=1 i=1 


A remark. It is important to note that the associativity of 
the multiplication of vectors by numbers is that very property 
because of which one can omit the dot sign in writing a product 
of a number and a vector: 


qœ&qa=Q:a. 


Below I use both forms of writing for products of vectors by 
numbers intending to more clarity, conciseness, and aesthetic 
beauty of formulas. 

The last item 8 of the theorem 7.1 expresses the property 
of the numeric unity in the form of the relationship 1-a = a. 
This property is used in collecting similar terms and in finding 
common factors. Let’s consider an example: 


a+3-b+2-a+b=ai2-a+3-b+b=1-a+2-a+ 
+3-b+1-b=(14+2)-a+(34+1)-b=3-a+4-b. 


EXERCISE 8.3. Using the relationship 1-a = a, prove that the 
conditions a -a = 0 and a Æ 0 imply a = 0. 


§9. Linear combinations. Triviality, 
non-triviality, and vanishing. 


Assume that some set of n free vectors aj,..., an is given. 
One can call it a collection of n vectors, a system of n vectors, or 
a family of n vectors either. 

Using the operation of vector addition and the operation of 
multiplication of vectors by numbers, one can compose some 
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vectorial expression of the vectors ai, ... , an. It is quite likely 
that this expression will comprise sums of vectors taken with 
some numeric coefficients. 


DEFINITION 9.1. An expression of the form qı a, +... + Qn an 
composed of the vectors aj, ..., an is called a linear combina- 
tion of these vectors. The numbers aj,...,Q, are called the 
coefficients of a linear combination. If 


Qai aı +... + Qn apn = b, (9.1) 


then the vector b is called the value of a linear combination. 


In complicated vectorial expressions linear combinations of the 
vectors ai, ..., an can be multiplied by numbers and can be 
added to other linear combinations which can also be multiplied 
by some numbers. Then these sums can be multiplied by numbers 
and again can be added to other subexpressions of the same 
sort. This process can be repeated several times. However, 
upon expanding, upon applying the formula (8.11), and upon 
collecting similar terms with the use of the formula (8.10) all such 
complicated vectorial expressions reduce to linear combinations 
of the vectors aj, ... , an. Let’s formulate this fact as a theorem. 


THEOREM 9.1. Each vectorial expression composed of vectors 
aj, ... , an by means of the operations of addition and multiplica- 
tion by numbers can be transformed to some linear combination 
of these vectors a1, ... , an- 


The value of a linear combination does not depend on the order 
of summands in it. For this reason linear combinations differing 
only in order of summands are assumed to be coinciding. For 
example, the expressions a1 aı +. ..+4@n an and Qn an +.. .+Q1 a 
are assumed to define the same linear combination. 


DEFINITION 9.2. A linear combination a,;a;, +... + Qn an 
composed of the vectors ai, ... , an is called trivial if all of its 
coefficients are zero, i.e. if a1 =... =a, = 0. 


34 CHAPTER I. VECTOR ALGEBRA. 


DEFINITION 9.3. A linear combination a1 aı +... + Qn an 
composed of vectors aj, ... , Ay is called vanishing or being equal 
to zero if its value is equal to the 
null vector, i.e. if the vector b 


az in (9.1) is equal to zero. 


j C Each trivial linear combina- 

A tion is equal to zero. However, 
a3 the converse is not valid. Noe 

: each vanishing linear combina- 

Fig. 9.1 tion is trivial. In Fig. 9.1 we 

have a triangle ABC. Its sides are marked as vectors aj = AB, 
a2 = BC, and a3 = CA’ . By construction the sum of these three 


vectors a1, a2, and ag in Fig. 9.1 is zero: 


aj + a2 + ag = AB’ + BC + Cx = 0. (9.2) 
The equality (9.2) can be written as follows: 
l-ai +1-a+ 1-a = 0. (9.3) 


It is easy to see that the linear combination in the left hand side 
of the equality (9.3) is not trivial (see Definition 9.2), however, it 
is equal to zero according to the definition 9.3. 


DEFINITION 9.4. A linear combination a1 aı +... + Qn an 
composed of the vectors ai, ... , an is called non-trivial if it is 
not trivial, i.e. at least one of its coefficients @1, ... , @n is not 
equal to zero. 


§10. Linear dependence and linear independence. 
DEFINITION 10.1. A system of vectors a,,..., an is called 


linearly dependent if there is a non-trivial linear combination of 
these vectors which is equal to zero. 
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The vectors a, a2, a3 shown in Fig. 9.1 is an example of a 
linearly dependent set of vectors. 

It is important to note that the linear dependence is a property 
of systems of vectors, it is not a property of linear combinations. 
Linear combinations in the definition 10.1 are only tools for 
revealing the linear dependence. 

It is also important to note that the linear dependence, once it 
is present in a collection of vectors aj, ... , an, does not depend 
on the order of vectors in this collection. This follows from the 
fact that the value of any linear combination and its triviality or 
non-triviality are not destroyed if we transpose its summands. 


DEFINITION 10.2. A system of vectors aj,..., an is called 
linearly independent, if it is not linearly dependent in the sense of 
the definition 10.1, i.e. if there is no linear combination of these 
vectors being non-trivial and being equal to zero simultaneously. 


One can prove the existence of a linear combination with the 
required properties in the definition 10.1 by finding an example of 
such a linear combination. However, proving the non-existence in 
the definition 10.2 is more difficult. For this reason the following 
theorem is used. 


THEOREM 10.1 (linear independence criterion). A system 
of vectors a1, ... , an is linearly independent if and only if vanish- 
ing of a linear combination of these vectors implies its triviality. 


Proor. The proof is based on a simple logical reasoning. 
Indeed, the non-existence of a linear combination of the vec- 
tors aj, ..., an, being non-trivial and vanishing simultaneously 
means that a linear combination of these vectors is inevitably 
trivial whenever it is equal to zero. In other words vanishing of a 
linear combination of these vectors implies triviality of this linear 
combination. The theorem 10.1 is proved. 


THEOREM 10.2. A system of vectors aj, ... , An is linearly 
independent if and only if non-triviality of a linear combination 
of these vectors implies that it is not equal to zero. 
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The theorem 10.2 is very similar to the theorem 10.1. However, 
it is less popular and is less often used. 


EXERCISE 10.1. Prove the theorem 10.2 using the analogy 
with the theorem 10.1. 


§11. Properties of the linear dependence. 


DEFINITION 11.1. The vector b is said to be expressed as a 
linear combination of the vectors a;,..., a, if it is the value 
of some linear combination composed of the vectors aj, ... , an 
(see (9.1)). In this situation for the sake of brevity the vector 
b is sometimes said to be linearly expressed through the vectors 
aj, ... , an or to be expressed in a linear way through aj, ... , an. 


There are five basic properties of the linear dependence of 
vectors. We formulate them as a theorem. 


THEOREM 11.1. The relation of the linear dependence for a 
system of vectors possesses the following basic properties: 


1) a system of vectors comprising the null vector is linearly de- 
pendent; 

2) asystem of vectors comprising a linearly dependent subsystem 
is linearly dependent itself; 

3) if a system of vectors is linearly dependent, then at least one 
of these vectors is expressed in a linear way through other 
vectors of this system; 

4) ifa system of vectors a1, ... , an is linearly independent, while 
complementing it with one more vector a,+41 makes the system 
linearly dependent, then the vector a41 is linearly expressed 


through the vectors aj, ... , an; 

5) if a vector b is linearly expressed through some m vectors 
ai, ... , am and if each of the vectors aj, ... , Am is linearly 
expressed through some other n vectors c1, ... , Cn, then the 


vector b is linearly expressed through the vectors c1, ... , Cn- 
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The properties 1)—5) in the theorem 11.1 are relatively simple. 
Their proofs are purely algebraic, they do not require drawings. 
I do not prove them in this book since the reader can find their 
proofs in §3 of Chapter I in the book [1]. 

Apart from the properties 1)—5) listed in the theorem 11.1, 
there is one more property which is formulated separately. 


THEOREM 11.2 (Steinitz). If the vectors aq, ... , an are lin- 
early independent and if each of them is linearly expressed through 
some other vectors bı, ... , bm, then m > n. 


The Steinitz theorem 11.2 is very important in studying mul- 
tidimensional spaces. We do not use it in studying the three- 
dimensional space E in this book. 


§12. Linear dependence for n = 1. 


Let’s consider the case of a system composed of a single vector 
a, and apply the definition of the linear dependence 10.1 to this 
system. The linear dependence of such a system of one vector 
a; means that there is a linear combination of this single vector 
which is non-trivial and equal to zero at the same time: 


aya, = 0. (12.1) 


Non-triviality of the linear combination in the left hand side of 
(12.1) means that a; 4 0. Due to a; Æ 0 from (12.1) we derive 


a, =0 (12.2) 

(see Exercise 8.3). Thus, the linear dependence of a system 
composed of one vector a; yields a; = 0. 

The converse proposition is also valid. Indeed, assume that 


the equality (12.2) is fulfilled. Let’s write it as follows: 


1-a; =0. (12.3) 
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The left hand side of the equality (12.3) is a non-trivial linear 
combination for the system of one vector a; which is equal to 
zero. Its existence means that such a system of one vector is 
linearly dependent. We write this result as a theorem. 


THEOREM 12.1. A system composed of a single vector a, is 
linearly dependent if and only if this vector is zero, i.e. a; = 0. 


§13. Linear dependence for n = 2. 
Collinearity of vectors. 


Let’s consider a system composed of two vectors a; and ag. 
Applying the definition of the linear dependence 10.1 to it, we 
get the existence of a linear combination of these vectors which 
is non-trivial and equal to zero simultaneously: 


Q1 a1 + Q2 a2 = 0. (13.1) 


The non-triviality of the linear combination in the left hand side 
of the equality (13.1) means that a, # 0 or ag # 0. Since the 
linear dependence is not sensitive to the order of vectors in a 
system, without loss of generality we can assume that a, Æ 0. 
Then the equality (13.1) can be written as 


a, = —— a. (13.2) 


Let’s denote 82 = —a2/a; and write the equality (13.2) as 
aj, = Bo ag. (13.3) 


Note that the relationship (13.3) could also be derived by means 
of the item 3 of the theorem 11.1. 

According to (13.3), the vector a; is produced from the vector 
a2 by multiplying it by the number 2. In multiplying a vector 
by a number it length is usually changed (see Formulas (6.1), 
(6.2), (6.3), and Figs. 6.1, 6.2, and 6.3). As for its direction, it 
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either is preserved or is changed for the opposite one. In both 
of these cases the vector 62a, is parallel to the vector ag. If 
82 = 0m the vector 69 a2 appears to be the null vector. Such a 
vector does not have its own direction, the null vector is assumed 
to be parallel to any other vector by definition. As a result of the 
above considerations the equality (13.3) yields 


al | ag. (13.4) 


In the case of vectors for to denote their parallelism a special 
term collinearity is used. 


DEFINITION 13.1. Two free vectors a; and a> are called 
collinear, if their geometric realizations are parallel to some 
straight line common to both of them. 


As we have seen above, in the case of two vectors their 
linear dependence implies the collinearity of these vectors. The 
converse proposition is also valid. Assume that the relationship 
(13.4) is fulfilled. If both vectors a; and ag are zero, then 
the equality (13.3) is fulfilled where we can choose $2 = 1. If 
at least one the two vectors is nonzero, then up to a possible 
renumerating these vectors we can assume that ap 4 0. Having 
built geometric realizations ag = AB and a, = AC, one can 
choose the coefficient 62 on the base of the Figs. 6.1, 6.2, or 6.3 
and on the base of the formulas (6.1), (6.2), (6.3) so that the 
equality (13.3) is fulfilled in this case either. As for the equality 
(13.3) itself, we write it as follows: 


1- aı + (—B2) "a? = 0. (13.5) 


Since 1 Æ 0, the left hand side of the equality (13.5) is a non- 
trivial linear combination of the vectors a; and ag which is equal 
to zero. The existence of such a linear combination means that 
the vectors a; and ag are linearly dependent. Thus, the converse 
proposition that the collinearity of two vectors implies their linear 
dependence is proved. 
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Combining the direct and converse propositions proved above, 
one can formulate them as a single theorem. 


THEOREM 13.1. A system of two vectors a; and ag is linearly 
dependent if and only if these vectors are collinear, i.e. a, || a2. 


§14. Linear dependence for n = 3. 
Coplanartity of vectors. 


Let;s consider a system composed of three vectors aj, ag, 

and a3. Assume that it is linearly dependent. Applying the 

item 3 of the theorem 11.1 to 

this system, we get that one of 

the three vectors is linearly ex- 

pressed through the other two 

vectors. Taking into account the 

C possibility of renumerating our 

vectors, we can assume that the 

A vector a, is expressed through 
ag the vectors a> and ag: 


Fig. 14.1 a; = b2 a2 + (3 a3. (14.1) 


Let A be some arbitrary point of the space E. Let’s build 
the geometric realizations ag = AB and fa = AC’ . Then at 
the point C we build the geometric realizations of the vectors 
az = CD and $3a3 = CE. The vectors AC and CE constitute 
two sides of the triangle ACE (see Fig. 14.1). Then the sum of 
the vectors (14.1) is presented by the third side a, = AE’. 

The triangle ACE is a planar form. The geometric realizations 
of the vectors a, a2, and ag lie on the sides of this triangle. 
Therefore they lie on the plane ACE. Instead of a, = AE’ ; 

o : 
a2 = AB, and a3 = CD by means of parallel translations we can 
build some other geometric realizations of these three vectors. 
These geometric realizations do not lie on the plane ACE, but 
they keep parallelism to this plane. 


§14. LINEAR DEPENDENCE FOR n= 3. 41 


DEFINITION 14.1. Three free vectors a1, a2, and ag are called 
coplanar if their geometric realizations are parallel to some plane 
common to all three of them. 


LEMMA 14.1. The linear dependence of three vecctors aj, a2, 
a3 implies their coplanarity. 


EXERCISE 14.1. The above considerations yield a proof for the 
lemma 14.1 on the base of the formula (14.1) in the case where 


ag x 0, ag F 0, ag }{ as. (14.2) 


Consider special cases where one or several conditions (14.2) are 
not fulfilled and derive the lemma 14.1 from the formula (14.1) in 
those special cases. 


LEMMA 14.2. The coplanarity of three vectors a,, a2, a3 imply 
their linear dependence. 


PROOF. If ag = O or a3 = 0, then the propositions of the 
lemma 14.2 follows from the first item of the theorem 11.1. If 
ag x 0, a3 Æ 0, ag | a3, then the 
proposition of the lemma 14.2 
follows from the theorem 13.1 
and from the item 2 of the the- 
orem 11.1. Therefore, in or- 
der to complete the proof of the 
lemma 14.2 we should consider 
the case where all of the three 
conditions (14.2) are fulfilled. 
Fig. 14.2 Let A be some arbitrary point 


= 


of the space E. At this point 


D 


we build the geometric realizations of the vectors aj = AD, 
a2 = AC, and a3 = AB’ (see Fig. 14.2). Due to the coplanarit 
of the vectors aj, a2, and ag their geometric realizations AB, 
AC’, , and AD’ lie on a plane. Let’s denote this plane through 
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a. Through the point D we draw a line parallel to the vector 
a3 #0. Such a line is unique and it lies on the plane a. This 
line intersects the line comprising the vector ag = AC at some 
unique point F since az # 0 and ag }{ ag. Considering the points 
A, E, and D in Fig. 14.2, we derive the equality 


a= AD = AE + ED. (14.3) 


—> a 
The vector AF is collinear to the vector AC = ag Æ 0 since 
these vectors lie on the same line. For this reason there is a 
number (2 such that AE = By ag. The vector ED is collinear to 
the vector AB = a3 Æ 0 since these vectors lie on parallel lines. 
Freed ‘ : 
Hence ED = $3 a3 for some number 83. Upon substituting 


AE’ = Bp an, ED = bz a3 


into the equality (14.3) this equality takes the form of (14.1). 
The last step in proving the lemma 14.2 consists in writing the 
equality (14.1) in the following form: 


1- a; + (—62) : a2 + (—83) : a3 = 0. (14.4) 


Since 1 Æ 0, the left hand side of the equality (14.4) is a non- 
trivial linear combination of the vectors a;, a2, a3 which is equal 
to zero. The existence of such a linear combination means that 
the vectors a, a2, and ag are linearly dependent. 


The following theorem is derived from the lemmas 14.1 and 14.2. 


THEOREM 14.1. A system of three vectors aı, a2, a3 is linearly 
dependent if and only if these vectors are coplanar. 


§15. Linear dependence for n > 4. 


THEOREM 15.1. Any system consisting of four vectors aj, a2, 
a3, a4 in the space E is linearly dependent. 
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THEOREM 15.2. Any system consisting of more than four vec- 
tors in the space E is linearly dependent. 


The theorem 15.2 follows from the theorem 15.1 due to the 
item 3 of the theorem 11.1. Therefore it is sufficient to prove the 
theorem 15.1. The theorem 15.1 itself expresses a property of the 
three-dimensional space E. 


PROOF OF THE THEOREM 15.1. Let’s choose the subsystem 
composed by three vectors aj, a2, a3 within the system of 
four vectors a;, a2, a3, a4. If these three vectors are linearly 


E 


D 


Fig. 15.1 


dependent, then in order to prove the linear dependence of the 
vectors a1, a2, a3, a4 it is sufficient to apply the item 3 of the 
theorem 11.1. Therefore in what fallows we consider the case 
where the vectors a, a2, a3 are linearly independent. 

From the linear independence of the vectors aq, a2, a3, ac- 
cording to the theorem 14.1, we derive their non-coplanarity. 
Moreove, from the linear independence of aj, a2, a3 due to the 
item 3 of the theorem 11.1 we derive the linear independence of 
any smaller subsystem within the system of these three vectors. 
In particular, the vectors a, a2, ag are nonzero and the vectors 
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a; and a, are not collinear (see Theorems 12.1 and 13.1), i.e. 


ai F 0, ag x 0, a3 x 0, ay Ht ag. (15.1) 


Let A be some arbitrary point of the space E. Let’s build the 
geometric realizations aj = AB’ , a = AC, a3 = AD, ay = AE 
with the initial point A. Due to the condition a; }{ ag in (15.1) 
the vectors AB’ and AC’ define a plane (see Fig. 15.1). Let’s 
denothe this plane through a. The vector AD’ does not lie on 
the plane a and it is not parallel to this plane since the vectors 
ai, a2, ag are not coplanar. 

Let’s draw a line passing through the terminal point of the 
vector a4 = AF and being parallel to the vector a3. Since a3 }{ a, 
this line crosses the plane a at some unique point F and we have 


ey = AB = AF + FE. (15.2) 


Now let’s draw a line passing through the point F and being 
parallel to the vector ag. Such a line lies on the plane a. Due to 
a, }{ ag this line intersects the line AB at some unique point G. 
Hence we have the following equality: 


AF = AG’ + GF. (15.3) 


Note that the vector AG lies on the same line as the vector 
a, = AB. From (15.1) we get a; # 0. Hence there is a number 
3, such that AG = 3, a,. Similarly, from GF || ag and ag # 0 
we derive GF = 62a, for some number (2 and from FE || a3 
and a3 Æ O we derive that FE = 83 a3 for some number B3. The 
rest, is to substitute the obtained expressions for AG, GF , and 
FE into the formulas (15.3) and (15.2). This yields 


a4 = 6, a; + b2 a2 + p3 ag. (15.4) 
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The equality (15.4) can be rewritten as follows: 
1-aq+(—f1)- a1 + (—B2)- a2 + (—83) + a3 = 0. (15.5) 


Since 1 Æ 0, the left hand side of the equality (15.5) is a non- 
trivial linear combination of the vectors a1, a2, a3, a4 which is 
equal to zero. The existence of such a linear combination means 
that the vectors aj, a2, a3, a3 are linearly dependent. The 
theorem 15.1 is proved. 


§ 16. Bases on a line. 


Let a be some line in the space E. Let’s consider free vectors 
parallel to the line a. They have geometric realizations lying on 
the line a. Restricting the freedom of moving such vectors, i.e. 
forbidding geometric realizations outside the line a, we obtain 
partially free vectors lying on the line a. 


DEFINITION 16.1. A system consisting of one non-zero vector 
e Æ 0 lying on a line a is called a basis on this line. The vector e 
is called the basis vector of this basis. 


Let e be the basis vector of 

z some basis on the line a and let 

mee x be some other vector lying on 
this line (see Fig. 16.1). Then 

x || e and hence there is a num- 


Fig. 16.1 ber x such that the vector x is 
expressed through a by means of the formula 


x= ze. (16.1) 


The number «v in the formula (16.1) is called the coordinate of 
the vector x in the basis e, while the formula (16.1) itself is called 
the expansion of the vector x in this basis. 

When writing the coordinates of vectors extracted from their 
expansions (16.1) in a basis these coordinates are usually sur- 
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rounded with double vertical lines 
x + |z|. (16.2) 


Then these coordinated turn to matrices (see [7]). The mapping 
(16.2) implements the basic idea of analytical geometry. This 
idea consists in replacing geometric objects by their numeric pre- 
sentations. Bases in this case are tools for such a transformation. 


§17. Bases on a plane. 


Let œ be some plane in the space E. Let’s consider free vectors 
parallel to the plane a. They have geometric realizations lying on 
the plane a. Restricting the freedom of moving such vectors, i.e. 
forbidding geometric realizations outside the plane a, we obtain 
partially free vectors lying on the plane a. 


DEFINITION 17.1. An ordered pair of two non-collinear vec- 
tors e1, e2 lying on a plane a is called a basis on this plane. The 
vectors e; and eg are called the basis vectors of this basis. 


In the definition 17.1 the term “ordered system of vectors” 
is used. This term means a system of vectors in which some 
ordering of vectors is fixed: e4 is 
the first vector, e2 is the second 
vector. If we exchange the vec- 
tors e; and ey and take es for 
the first vector, while e; for the 
second vector, that would be an- 
other basis €1, €2 different from 
the basis e1, e2: 


€; = eo, @o = €]. 


Let e1, e2 be a basis on a 
Fig. 17.1 plane a and let x be some vector 
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lying on this place. Let’s choose some arbitrary point O € a and 
let’s build the geometric realizations of the three vectors e1, e2, 
and x with the initial point O: 


GT ason. x=0C. (17.1) 


Due to our choice of the vectors e1, e2, x and due to O € a the 
geometric realizations (17.1) lie on the plane a. Let’s draw a line 
passing through the terminal point of the vector x, i.e. through 
the point C, and being parallel to the vector e2 = OB. Due to 
non-collinearity of the vectors e; }{ e2 such a line intersects the 
line comprising the vector e, = OA at some unique point D (see 
Fig. 17.1). This yields the equality 


OC 30D 4DC. (17.2) 


a 
sero = 
The vector OD in (17.2) is collinear with the vector e; = OA, 


—> ; 
while the vector DC is collinear with the vector eg = OB. For 
this reason there are two numbers x, and zə such that 


OD = Tı €l, DC = T2 €2. (17.3) 


Upon substituting (17.3) into (17.2) and taking into account the 
formulas (17.1) we get the equality 


X = T1 €1 + T2 eo. (17.4) 


The formula (17.4) is analogous to the formula (16.1). The 
numbers x; and zə are called the coordinates of the vector x 
in the basis e1, e2, while the formula (17.4) itself is called the 
expansion of the vector x in this basis. When writing the 
coordinates of vectors they are usually arranged into columns 
and surrounded with double vertical lines 


Tı 


x> (17.5) 
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The column of two numbers x; and zə in (17.5) is called the 
coordinate column of the vector x. 


§ 18. Bases in the space. 


DEFINITION 18.1. An ordered system of three non-coplanar 
vectors €1, €2, e3 is called a basis in the space 


(J 


Let e1, €2, e3 be a basis in the space E and let x be some 
vector. Let’s choose some arbitrary point O and build the 
geometric realizations of all of the four vectors e1, e2, e3, and x 
with the common initial point O: 


e]; = OA, e2 = OB, ) 
18.1 
e3 = OC’, x=OD. 


The vectors eq and e> are not collinear since otherwise the 
whole system of three vectors e1, e2, e3 would be coplanar. For 
this reason the vectors e} = OA and e2 = OB define a plane 
(this plane is denoted through a 
in Fig18.1) and they lie on this 
plane. The vector e3 = OC’ does 
not lie on the plane a and it is 
not parallel to this plane (see 
Fig. 18.1). 

Let’s draw a line passing thro- 
ugh the terminal point of the 


D 


vector x = OD and being par- 
allel to the vector e3. Such a 
line is not parallel to the plane 
a since e3 Y a. It crosses the 
plane a at some unique point E. 
As a result we get the equality 


Fig. 18.1 OD =OF+ED. (18.2) 


§18. BASES IN THE SPACE. 49 


Now let’s draw a line passing through the point E and being 
parallel to the vector e2. The vectors e; and eg in (18.1) are not 
collinear. For this reason such a line crosses the line comprising 
the vector e at some unique point F. Considering the sequence 
of points O, F, E, we derive 


OF =OF +FE. (18.3) 


Combining the equalities (18.3) and (18.2), we obtain 


OD = OF + FEF + ED. (18.4) 


Note that, according to our geometric construction, the fol- 
lowing collinearity conditions are fulfilled: 


—> —> —> 
OF || e1, FE || eg, ED || es. (18.5) 


From the collinearity condition OF || e1 in (18.5) we derive the 
existence of a number x; such that OF = x, e;. From the other 
two collinearity conditions in (18.5) wec derive the existence of two 
other numbers x2 and z3 such that FE = z£ e and ED = z3 e3. 
As a result the equality (18.4) is written as 


X = T1 €1 + T2 €2 + T3 €3. (18.6) 


The formula (18.6) is analogous to the formulas (16.1) and 
(17.4). The numbers z1, £2, z3 are called the coordinates of the 
vector x in the basis e1, e2, e3, while the formula (18.6) itself is 
called the expansion of the vector x in this basis. In writing the 
coordinates of a vector they are usually arranged into a column 
surrounded with two double vertical lines: 


Tı 
x> || z2l||. (18.7) 
T3 


CopyRight © Sharipov R.A., 2010. 


50 CHAPTER I. VECTOR ALGEBRA. 


The column of the numbers 21, £2, 23 in (18.7) is called the 
coordinate column of a vector x. 


§19. Uniqueness of the expansion of a vector in a basis. 


x 


Let e1, €2, e3 be some basis in the space E. The geometric 
construction shown in Fig. 18.1 can be applied to an arbitrary 
vector x. It yields an expansion of (18.6) of this vector in the 
basis e1, €2, e3. However, this construction is not a unique way 
foe expanding a vector in a basis. For example, instead of the 
plane OAB the plane OAC can be taken for a, while the line 
can be directed parallel to the vector e2. The the line EF is 
directed parallel to the vector e3 and the point F is obtained 
in its intersection with the line OA comprising the geometric 
realization of the vector e;. Such a construction potentially 
could yield some other expansion of a vector x in the basis 
€1, €2, €3, i.e. an expansion with the coefficients different from 
those of (18.6). The fact that actually this does not happen 
should certainly be proved. 


THEOREM 19.1. The expansion of an arbitrary vector x in a 
given basis e1, €2, e3 is unique. 


PROOF. The proof is by contradiction. Assume that the 
expansion (18.6) is not unique and we have some other expansion 
of the vector x in the basis e;, eg, e3: 


X = Ťı €] + %T2€04+ £3 €3. (19.1) 


Let’s subtract th expansion (18.6) from the expansion (19.1). 
Upon collecting similar term we get 


(ži = zı) eı + (Ho = z2) e2 + (%3 = x3) e3 = 0. (19.2) 


According to the definition 18.1, the basis e1, e2, e3 is a triple 
of non-coplanar vectors. Due to the theorem 14.1 such a triple 
of vectors is linearly independent. Looking at (19.2), we see that 
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there we have a linear combination of the basis vectors e1, e2, €3 
which is equal to zero. Applying the theorem 10.1, we conclude 
that this linear combination is trivial, i.e. 


Ly =T] = 0, Ta = p = 0, T3 — £3 = 0. (19.3) 


The equalities (19.3) mean that the coefficients in the expansions 
(19.1) and (18.6) do coincide, which contradicts the assumption 
that these expansions are different. The contradiction obtained 
proves the theorem 19.1. 


EXERCISE 19.1. By analogy to the theorem 19.1 formulate and 
prove uniqueness theorems for expansions of vectors in bases on a 
plane and in bases on a line. 


§ 20. Index setting convention. 


The theorem 19.1 on the uniqueness of an expansion of vec- 
tor in a basis means that the mapping (18.7), which associates 
vectors with their coordinates in some fixed basis, is a bijective 
mapping. This makes bases an important tool for quantitative 
description of geometric objects. This tool was improved sub- 
stantially when a special index setting convention was admitted. 
This convention is known as Einstein’s tensorial notation. 


DEFINITION 20.1. The index setting convention, which is also 
known as Einstein’s tensorial notation, is a set of rules for placing 
indices in writing components of numeric arrays representing 
various geometric objects upon choosing some basis or some 
coordinate system. 


Einstein’s tensorial notation is not a closed set of rules. When 
new types of geometric objects are designed in science, new rules 
are added. For this reason below I formulate the index setting 
rules as they are needed. 
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DEFINITION 20.2. Basis vectors in a basis are enumerated by 
lower indices, while the coordinates of vectors expanded in a 
basis are enumerated by upper indices. 


The rule formulated in the definition 20.2 belongs to Einstein’s 
tensorial notation. According to this rule the formula (18.6) 
should be rewritten as 


3 
x= q! e.+2°e,+2%e3 = 2'e;, (20.1) 
i=l 


whire the mapping (18.7) should be written in the following way: 
xe |} x? |). (20.2) 


EXERCISE 20.1. The rule from the definition 20.2 is applied 


for bases on a line and for bases on a plane. Relying on this rule 
rewrite the formulas (16.1), (16.2), (17.4), and (17.5). 


§ 21. Usage of the coordinates of vectors. 


Vectors can be used in solving various geometric problems, 
where the basic algebraic operations with them are performed. 
These are the operation of vector addition and the operation of 
multiplication of vectors by numbers. The usage of bases and 
coordinates of vectors in bases relies on the following theorem. 


THEOREM 21.1. Let some basis e1, €2, e3 in the space E be 
chosen and fixed. In this situation when adding vectors their co- 
ordinates are added, while when multiplying a vector by a number 
its coordinates are multiplied by this number, i.e. if 


x! yt 
xe || x? |], yoy ll, (21.1) 
3 3 
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then for x + y and a- x we have the relationships 


al +y! agl 
x+y |z? +y? Il, a-x l|ar? ||. (21.2) 
r3 + y3 enue 


EXERCISE 21.1. Prove the theorem 21.1, using the formulas 
(21.1) and (21.2) for this purpose, and prove the theorem 19.1. 


EXERCISE 21.2. Formulate and prove theorems analogous to 
the theorem 21.1 in the case of bases on a line and on a plane. 


§22. Changing a basis. Transition formulas 
and transition matrices. 


When solving geometric problems sometimes one needs to 
change a basis replacing it with another basis. Let e1, e2, e3 and 
č1, €2, čą be two bases in the space E. If the basis e1, e2, e3 
is replaced by the basis €1, č2, č, then e1, e2, e3 is usually 
called the old basis, while €1, č2, č is called the new basis. 
The procedure of changing an old basis for a new one can be 
understood as a transition from an old basis to a new basis or, in 
other words, as a direct transition. Conversely, changing a new 
basis for an old one is understood as an inverse transition. 

Let e1, e2, e3 and č, č2, €3 be two bases in the space E, 
where e1, €2, e3 is an old basis and ej, €2, č is a new basis. 
In the direct transition procedure vectors of a new basis are 
expanded in an old basis, i.e. we have 


e) = Si e; + S? e2 a S? e3, 
eo = ce eı + S5 e2 T ae €3, (22:1) 


~ 1 2 3 
e3 = S3 €i + S3 e2 T S3 e3. 


The formulas (22.1) are called the direct transition formulas. The 
numeric coefficients Si, $?, 9} in (22.1) are the coordinates 
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of the vector č expanded in the old basis. According to the 
definition 20.2, they are enumerated by an upper index. The 
lower index 1 of them is the number of the vector & of which 
they are the coordinates. It is used in order to distinguish the 
coordinates of the vector €; from the coordinates of č and é3 in 
the second and in the third formulas (22.1). 

Let’s apply the first mapping (20.2) to the transition formulas 
and write the coordinates of the vectors €;, €2, €3 as columns: 


Si S2 93 
& e || S?2 |], č > || S2 |, 63 > || S2 ||. (22.2) 
St 55 93 


The columns (22.2) are usually glued into a single matrix. Such 
a matrix is naturally denoted through S: 


Si 92 S3 
S= || S? S2 Ge ll: (22.3) 
St S3 93 


DEFINITION 22.1. The matrix (22.3) whose components are 
determined by the direct transition formulas (22.1) is called the 
direct transition matriz. 


Note that the components of the direct transition matrix Si 
are enumerated by two indices one of which is an upper index, 
while the other is a lower index. These indices define the position 
of the element Si in the matrix S: the upper index i is a row 
number, while the lower index j is a column number. ‘This 
notation is a part of a general rule. 


DEFINITION 22.2. If elements of a double index array are 
enumerated by indices on different levels, then in composing 
a matrix of these elements the upper index is used as a row 
number, while the lower index is used a column number. 
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DEFINITION 22.3. If elements of a double index array are 
enumerated by indices on the same level, then in composing a 
matrix of these elements the first index is used as a row number, 
while the second index is used a column number. 


The definitions 22.2 and 22.3 can be considered as a part of 
the index setting convention from the definition 20.1, though 
formally they are not since they do not define the positions of 
array indices, but the way to visualize the array as a matrix. 

The direct transition formulas (22.1) can be written in a 
concise form using the summation sign for this purpose: 


3 
čj = > 5 e;, where j = 1, 2, 3. (22.4) 
i=1 


There is another way to write the formulas (22.1) concisely. It is 


based on the matrix multiplication (see [7]): 
St we. Oe 
[č č č||=]ļe1 e2 esl- |S? SZ $3]. (22.5) 
S? 8. S3 


Note that the basis vectors &1, č2, €3 and e1, e2, e3 in the for- 
mula (22.5) are written in rows. This fact is an instance of a 
general rule. 


DEFINITION 22.4. If elements of a single index array are 
enumerated by lower indices, then in matrix presentation they 
are written in a row, i.e. they constitute a matrix whose height 
is equal to unity. 


DEFINITION 22.5. If elements of a single index array are 
enumerated by upper indices, then in matrix presentation they 
are written in a column, i.e. they constitute a matrix whose 
width is equal to unity. 
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Note that writing the components of a vector x as a col- 
umn in the formula (20.2) is concordant with the rule from the 
definition 22.5, while the formula (18.7) violates two rules at 
once — the rule from the definition 20.2 and the rule from the 
definition 22.5. 

Now let’s consider the inverse transition from the new basis 
€1, G2, č to the old basis e1, e2, e3. In the inverse transition 
procedure the vectors of an old basis are expanded in a new basis: 


ey = Ti ej T? &2 To e3, 
eo = T} 6, + T3 č + T3 Ss, (22.6) 


e3 = T3 č + Ts č + T3 8s. 


The formulas (22.6) are called the inverse transition formulas. 
The numeric coefficients in the formulas (22.6) are coordinates of 
the vectors e1, €2, e3 in their expansions in the new basis. These 
coefficients are arranged into columns: 


T} T} T} 
e, +> || T? |, e2 || TF ||, e3 +> || T? ||. (22.7) 
i i T3 
Then the columns (22.7) are united into a matrix: 
TE Ta T} 
Tare T TN- (22.8) 
T Ue T3 


DEFINITION 22.6. The matrix (22.8) whose components are 
determined by the inverse transition formulas (22.6) is called the 
inverse transition matriz. 

The inverse transition formulas (22.6) have a concise form, 
analogous to the formula (22.4): 

3 
e; =) T}é;, where j= 1, 2, 3. (22.9) 
i=1 


CopyRight © Sharipov R.A., 2010. 


§23. SOME INFORMATION ... 57 


There is also a matrix form of the formulas (22.6): 


Ti Tz Ts 
| ey €2 €&3 | = | el eo e€3 | T? To Ts ‘ (22.10) 
TETI 


The formula (22.10) is analogous to the formula (22.5). 


EXERCISE 20.1. By analogy to (22.1) and (22.6) write the 
transition formulas for bases on a plane and for bases on a line 
(see Definitions 16.1 and 17.1). Write also the concise and matrix 
versions of these formulas. 


§ 23. Some information on transition matrices. 


THEOREM 23.1. The matrices S and T whose components are 
determined by the transition formulas (22.1) and (22.6) are inverse 
to each other, i.e. T = S~! and S =T—}. 


I do not prove the theorem 23.1 in this book. The reader can 
find this theorem and its proof in [1]. 

The relationships T = S~! and S = T7! from the theo- 
rem 23.1 mean that the product of S by T and the product of T 
by S both are equal to the unit matrix (see [7]): 


S-T=1 T-S=1. (23.1) 
Let’s recall that the unit matrix is a square n x n matrix that 
has ones on the main diagonal and zeros in all other positions. 


Such a matrix is often denoted through the same symbol 1 as the 
numeric unity. Therefore we can write 


10 0 
1=|[0 1 oll. (23.2) 
001 
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In order to denote the components of the unit matrix (23.2) the 
symbol 6 is used. The indices enumerating rows and columns can 
be placed either on the same level or on different levels: 


ig ; 1 for i=j; 
ee ee { aa (23.3) 
0 for ify. 
DEFINITION 23.1. The double index numeric array 6 deter- 
mined by the formula (23.3) is called the Kronecker symbol! or 
the Kronecker delta. 


The positions of indices in the Kronecker symbol are deter- 
mined by a context where it is used. For example, the relation- 
ships (23.1) can be written in components. In this particular case 
the indices of the Kronecker symbol are placed on different levels: 


3 3 
so =o, YTS =S (23.4) 
j=1 j=1 


Such a placement of the indices in the Kronecker symbol in (23.4) 
is inherited from the transition matrices S and T. 

Noter that the transition matrices S and T are square ma- 
trices. For such matrices the concept of the determinant is 
introduced (see [7]). This is a number calculated through the 
components of a matrix according to some special formulas. In 
the case of the unit matrix (23.2) these formulas yield 


det1 = 1. (23.5) 


The following fact is also well known. Its proof can be found in 
the book [7]. 


THEOREM 23.2. The determinant of a product of matrices is 
equal to the product of their determinants. 


1 Don’t mix with the Kronecker symbol used in number theory (see [8]). 
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Let’s apply the theorem 23.2 and the formula (23.5) to the 
products of the matrices S and T in (23.1). This yields 


det S-detT = 1. (23.6) 


DEFINITION 23.2. A matrix with zero determinant is called 
degenerate. If the determinant of a matrix is nonzero, such a 
matrix is called non-degenerate. 


From the formula (23.6) and the definition 23.2 we immedi- 
ately derive the following theorem. 


THEOREM 23.3. For any two bases in the space E the corre- 
sponding transition matrices S and T are non-degenerate and the 
product of their determinants is equal to the unity. 


THEOREM 23.4. Each non-degenerate 3 x 3 matrix S is a tran- 
sition matrix relating some basis e,, e2, e3 in the space E with 
some other basis €1, €2, €3 in this space. 


The theorem 23.4 is a strengthened version of the theo- 
rem 23.3. Its proof can be found in the book [1]. 


EXERCISE 23.1. Formulate theorems analogous to the theo- 
rems 23.1, 23.2, and 23.4 in the case of bases on a plane and in 
the case of bases on a line. 


§ 24. Index setting in sums. 


As we have already seen, in dealing with coordinates of vectors 
formulas with sums arise. It is convenient to write these sums 
in a concise form using the summation sign. Writing sums in 
this way one should follow some rules, which are listed below as 
definitions. 


DEFINITION 24.1. Each summation sign in a formula has its 
scope. This scope begins immediately after the summation sign 
to the right of it and ranges up to some delimiter: 


1) the end of the formula; 
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2) the equality sign; 

3) the plus sign «+» or the minus sign «—» not enclosed into 
brackets opened after a summation sign in question; 

4) the closing bracket whose opening bracket precedes the sum- 
mation sign in question. 


Let’s recall that a summation sign is present in a formula and 
if some variable is used as a cycling variable in this summation 
sign, such a variable is called a summation index (see Formula 
(8.3) and the comment to it). 


DEFINITION 24.2. Each summation index can be used only 
within the scope of the corresponding summation sign. 


Apart from simple sums, multiple sums can be used in formu- 
las. They obey the following rule. 


DEFINITION 24.3. A variable cannot be used as a summation 
index in more than one summation signs of a multiple sum. 


DEFINITION 24.4. Variables which are not summation indices 
are called free variables. 


Summation indices as well as free variables can be used as 
indices enumerating basis vectors and array components. The 
following terminology goes along with this usage. 


DEFINITION 24.5. A free variable which is used as an index is 
called a free indez. 


In the definitions 24.1, 24.2 and 24.3 the commonly admit- 
ted rules are listed. Apart from them there are more special 
rules which are used within the framework of Einstein’s tensorial 
notation (see Definition 20.1). 


DEFINITION 24.6. If an expression is a simple sum or a multi- 
ple sum and if each summand of it does not comprise other sums, 
then each free index should have exactly one entry in this ex- 
pression, while each summation index should enter twice — once 
as an upper index and once as a lower index. 
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DEFINITION 24.7. The expression built according to the def- 
inition 24.6 can be used for composing sums with numeric coef- 
ficients. Then all summands in such sums should have the same 
set of free indices and each free index should be on the same level 
(upper or lower) in all summands. Regardless to the number of 
summands, in counting the number of entries to the whole sum 
each free index is assumed to be entering only once. The level 
of a free index in the sum (upper or lower) is determined by its 
level in each summand. 


Lets consider some expressions as examples: 


3 3 3s .3 
Sia bi, Soa Giz Se b? gik, (24.1) 
i=1 i=1 i=1 k=1 
3 3 3% 3 
S 2A barat > a0 yy O Be (242) 
k=1 j=1 r=1s=1 


EXERCISE 24.1. Verify that each expression in (24.1) satisfies 
the definition 24.6, while the expression (24.2) satisfies the de- 
finition 24.7. 


DEFINITION 24.8. Sums composed according to the defini- 
tion 24.7 can enter as subexpressions into simple and multiple 
sums which will be external sums with respect to them. Then 
some of their free indices or all of their free indices can turn into 
summation indices. Those of free indices that remain free are 
included into the list of free indices of the whole expression. 

In counting the number of entries of an index in a sum 
included into an external simple or multiple sum the rule from 
the definition 24.7 is applied. Taking into account this rue, each 
free index of the ultimate expression should enter it exactly once, 
while each summation index should enter it exactly twice — 
once as an upper index and once as a lower index. In counting 
the number of entries of an index in a sum included into an 
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external simple or multiple sum the rule from the definition 24.7 
is applied. Taking into account this rue, each free index of 
the ultimate expression should enter it exactly once, while each 
summation index should enter it exactly twice — once as an 
upper index and once as a lower index. In counting the number 
of entries of an index in a sum included into an external simple or 
multiple sum the rule from the definition 24.7 is applied. Taking 
into account this rue, each free index of the ultimate expression 
should enter it exactly once, while each summation index should 
enter it exactly twice — once as an upper index and once as a 
lower index. In counting the number of entries of an index in 
a sum included into an external simple or multiple sum the rule 
from the definition 24.7 is applied. Taking into account this rue, 
each free index of the ultimate expression should enter it exactly 
once, while each summation index should enter it exactly twice 
— once as an upper index and once as a lower index. 


As an example we consider the following expression which 
comprises inner and outer sums: 


3 3 
> Ai (2; +5 ch u’) . (24.3) 
k=1 i=1 


EXERCISE 24.2. Make sure that the expression (24.3) satisfies 
the definition 24.8. 


EXERCISE 24.3. Open the brackets in (24.3) and verify that 
the resulting expression satisfies the definition 24.7. 


The expressions built according to the definitions 24.6, 24.7, 
and 24.8 can be used for composing equalities. In composing 
equalities the following rule is applied. 


DEFINITION 24.9. Both sides of an equality should have the 
same set of free indices and each free index should have the same 
position (upper or lower) in both sides of an equality. 
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§ 25. Transformation of the coordinates 
of vectors under a change of a basis. 


Let e1, e2, e3 and €1, €2, €3 be two bases in the space E and 
let e1, €2, e3 be changed for the basis €;, €2, €3. As we already 
mentioned, in this case the basis e1, e2, e3 is called an old basis, 
while €;, €2, čą is called a new basis. 

Let’s consider some arbitrary vector x in the space E. Ex- 
panding this vector in the old basis e;, e2, e3 and in the new 
basis €;, €2, €3, we get two sets of its coordinates: 


1 


£ x 
xe |x’ |, xe |2|. (25.1) 
r? z3 


Both mappings (25.1) are bijective. For this reason there is a 
bijective correspondence between two sets of numbers zt, x”, x? 
and £!, #7, Z3. In order to get explicit formulas expressing 
the coordinates of the vector x in the new basis through its 


coordinates in the old basis we use the expansion (20.1): 
3 . 
x=) ae): (25.2) 
j=l 


Let’s apply the inverse transition formula (22.9) in order to 
express the vector e; in (25.2) through the vectors of the new 
basis €), €2, 63. Upon substituting (22.9) into (25.2), we get 


The formula (25.3) expresses the vector x as a linear combination 
of the basis vectors €1, €2, 63, i.e. it is an expansion of the vector 
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x in the new basis. Due to the uniqueness of the expansion of a 
vector in a basis (see Theorem 19.1) the coefficients of such an 
expansion should coincide with the coordinates of the vector x in 
the new basis €1, €2, €3: 


3 
zi = yT x), there i= l; 2:3: (25.4) 
j=l 


The formulas (25.4) expressing the coordinates of an arbitrary 
vector x in a new basis through its coordinates in an old basis 
are called the direct transformation formulas. Accordingly, the 
formulas expressing the coordinates of an arbitrary vector x in 
an old basis through its coordinates in a new basis are called the 
inverse transformation formulas. The latter formulas need not 
be derived separately. It is sufficient to move the tilde sign from 
the left hand side of the formulas (25.4) to their right hand side 
and replace T with S F: This yields 


3 
ri = DEn. where i= 1, 2, 3. (25.5) 
The direct transformation formulas (25.4) have the expanded 
form where the summation is performed explicitly: 
1 T} x! T x T4 r’, 


P =T r Hr tie, (25.6) 


z3 — T3 yl 3 2 $3 
welir tir +I a: 


R 


The same is true for the inverse transformation formulas (25.5): 
zt = Sil +5,0 +542, 
r? = R T + 922° + S32, (25.7) 


r? = By + S32? + S3 2°. 
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Along with (25.6), there is the matrix form of the formulas (25.4): 


fia Ti Tz T3|| jæ! 
#\|=||T2 T2 T2|/-|)2? (25.8) 
z3 3 


Tp T? T3i '* 


Similarly, the inverse transformation formulas (25.5), along with 
the expanded form (25.7), have the matrix form either 


xt St S2 S3 zl 
eS a Ge fe (25.9) 
3 x3 
wll sg sg sgl Ne 


EXERCISE 25.1. Write the analogs of the transformation for- 
mulas (25.4), (25.5), (25.6), (25.7), (25.8), (25.9) for vectors on a 
plane and for vectors on a line expanded in corresponding bases 
on a plane and on a line. 


§ 26. Scalar product. 


Let a and b be two nonzero free vectors, Let’s build their 
geometric realizations a = OA and b = OB at some arbitrary 
point O. The smaller of two angles 

formed by the rays [OA) and [OB) at 

A the point O is called the angle between 
B vectors OA and OB. In Fig. 26.1 this 


a P angle is denoted through y. The value 
b of the angle y ranges from 0 to 7: 
j O<pRn. 
Fig. 26.1 


The lengths of the vectors Ox and 
OB do not depend on the choice of a point O (see Definitions 3.1 
and 4.2). The same is true for the angle between them. Therefore 
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we can deal with the lengths of the free vectors a and b and with 
the angle between them: 


la| = [OA], lb| = |OB], ab = AOB = ọ. 


In the case where a = 0 or b = 0 the lengths of the vectors a and 
b are defined, but the angle between these vectors is not defined. 


DEFINITION 26.1. The scalar product of two nonzero vectors 
a and b is a number equal to the product of their lengths and 
the cosine of the angle between them: 


(a, b) = |a| |b] cos y. (26.1) 


In the case where a = 0 or b = 0 the scalar product (a,b) is 
assumed to be equal to zero by definition. 


A comma is the multiplication sign in the writing the scalar 
product, not by itself, but together with round brackets surround- 
ing the whole expression. These brackets are natural delimiters 
for multiplicands: the first multiplicand is an expression between 
the opening bracket and the comma, while the second multipli- 
cand is an expression between the comma and the closing bracket. 
Therefore in complicated expressions no auxiliary delimiters are 
required. For example, in the formula 


(a+b,c+d) 


the sums a+b and c + d are calculated first, then the scalar 
multiplication is performed. 

A remark. Often the scalar product is written as a- b. Even 
the special term «dot product» is used. However, to my mind, 
this notation is not good. It is misleading since the dot sign is 
used for denoting the product of a vector and a number and for 
denoting the product of two numbers. 


§27. ORTHOGONAL PROJECTION ONTO A LINE. 67 


§27. Orthogonal projection onto a line. 


Let a and b be two free vectors such that a £ 9. Let’s 
build their geometric realizations a= OA and b = OB at some 
arbitrary point O. The nonzero 
vector OA defines a line. Let’s 
drop the perpendicular from the 
terminal point of the vector OB, 
i.e. from the point B, to this line 
and let’s denote through C the 


A ao bi C base of this perpendicular (see 
Fig. 27.1). In the special case 
Fig. 27.1 where b || a and where the point 


B lies on the line OA we choose 
the point C coinciding with the point B. 
x ; =F = 
The point C determines two vectors OC and CB. ‘The 


vector OC is collinear to the vector a, while the vector CB is 
perpendicular to it. By means of parallel translations one can 
replicate the vectors oc and CB up to free vectors by and b, 
respectively. Note that the point C is uniquely determined by 
the point B and by the line OA (see Theorem 6.5 in Chapter III 
of the book [7]). For this reason the vectors by and b, do not 
depend on the choice of a point O and we can formulate the 
following theorem. 


THEOREM 27.1. For any nonzero vector a # 0 and for any 
vector b there two unique vectors b and b, such that the vector 
b; is collinear to a, the vector b, is perpendicular to a, and they 
both satisfy the equality being the expansion of the vector b: 


One should recall the special case where the point C coincides 
with the point B. In this case b, = O and we cannot verify 
visually the orthogonality of the vectors b, and a. In order 
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to extend the theorem 27.1 to this special case the following 
definition is introduced. 


DEFINITION 27.1. All null vectors are assumed to be per- 
pendicular to each other and each null vector is assumed to be 
perpendicular to any nonzero vector. 


Like the definition 3.2, the definition 27.1 is formulated for 
geometric vectors. Upon passing to free vectors it is convenient 
to unite the definition 3.2 with the definition 27.1 and then 
formulate the following definition. 


DEFINITION 27.2. A free null vector 0 of any physical nature 
is codirected to itself and to any other vector. A free null vector 
0 of any physical nature is perpendicular to itself and to any 
other vector. 


When taking into account the definition 27.2, the theorem 27.1 
is proved by the constructions preceding it, while the expansion 
(27.1) follows from the evident equality 


OB = 0C + 0B. 


Assume that a vector a Æ 0 is fixed. In this situation the 
theorem 27.1 provides a mapping 7, that associates each vector 
b with its parallel component b}. 


DEFINITION 27.3. The mapping Ta that associates each free 
vector b with its parallel component b, in the expansion (27.1) 
is called the orthogonal projection onto a line given by the vector 
a # 0 or, more exactly, the orthogonal projection onto the direc- 
tion of the vector a # 0. 


The orthogonal projection ma is closely related to the scalar 
product of vectors. This relation is established by the following 
theorem. 


THEOREM 27.2. For each nonzero vector a # 0 and for any 
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vector b the vector ta(b) is calculated by means of the formula 


Ta(b) = ae a. (27.2) 


PROOF. If b = 0 both sides of the equality (27.2) do vanish 
and it is trivially fulfilled. Therefore we can assume that b Æ 0. 

It is easy to see that the vectors in two sides of the equality 
(27.2) are collinear. For the beginning let’s prove that the lengths 
of these two vectors are equal to each other. The length of the 
vector Ta(b) is calculated according to Fig. 27.1: 


|7a(b)| = [b| = [b] | cos yl. (27.3) 


The length of the vector in the right hand side of the formula 
(27.2) is determined by the formula itself: 


_ [bl lal cos | 


= |b] | cos ọ]. (27.4) 


a) _ Wall 


lal? lal 


(27.5) 
Due to (27.5) in order to prove the equality (27.2) it is 
sufficient to prove the codirectedness of vectors 


(b,a) 
az * 


Talb) Tt (27.6) 


Since ma(b) = b}, again applying Fig. 27.1, we consider the 
following three possible cases: 


0< g< 7/2, p =n/2, q(2 Sips a, 
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In the first case both vectors (27.6) are codirected with the vector 
a #0. Hence they are codirected with each other. 

In the second case both vectors (27.6) are equal to zero. They 
are codirected according to the definition 3.2. 

In the third case both vectors (27.6) are opposite to the vector 
a#0. Therefore they are again codirected with each other. The 
relationship (27.6) and the theorem 27.2 in whole are proved. O 


DEFINITION 27.4. A mapping f acting from the set of all free 
vectors to the set of all free vectors is called a linear mapping if 
it possesses the following two properties: 


1) fla+b) = f(a) + f(b); 

2) flaa) =a f(a). 
The properties 1) and 2), which should be fulfilled for any two 
vectors a and b and for any number a, constitute a property 
which is called the linearity. 


THEOREM 27.3. For any nonzero vector a Æ 0 the orthogonal 
projection Ta onto a line given by the vector a is a linear mapping. 


In order to prove the theorem 27.3 we need the following 
auxiliary lemma. 


D 


LEMMA 27.1. For any nonze- 
ro vector a # 0 the sum of two 
vectors collinear to a is a vector 
collinear to a and the sum of two 
vectors perpendicular to a is a 
vector perpendicular to a. 


A 


PROOF OF THE LEMMA 27.1. 
The first proposition of the lem- 
ma is obvious. It follows imme- 
diately from the definition 5.1. 

Let’s prove the second propo- 
sition. Let b and c be two vec- 
Fig. 27.2 tors, such that b L a and c La. 
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In the cases b = 0, c = 0, b + c = O, and in the case b || c the 
second proposition of the lemma is also obvious. In these cases 
geometric realizations of all the three vectors b, c, and b+ c can 
be chosen such that they lie on the same line. Such a line is 
perpendicular to the vector a. 

Let’s consider the case where b }{ c. Let’s build a geometric 
realization of the vector b = OB with initial point at some 
arbitrary point O. Then we lay the vector c = BC at the 
terminal point B of the vector OB’ Since b } c, the points O, B, 
and C do not lie on a single straight line altogether. Hence they 
determine a plane. We denote this plane through a. The sum 


of vectors OC’ =0B + BC lies on this plane. It is a geometric 
realization for the vector b + c, i..e. OG = b+c. 

We build two geometric realizations OA and BD for the 
vector a. These are two different geometric vectors. From the 
equality OA = BD we derive that the lines OA and BD are 
parallel. Due to b L a and c L a the line BD is perpendicular to 
the pair of crosswise intersecting lines OB and BC lying on the 
plane a. Hence it is perpendicular to this plane. From BD L a 
and BD || OA we derive OA L a, while from OA L a we derive 
OA l OC. . Hence the sum of vectors b + c is perpendicular to 
the vector a. The lemma 27.1 is proved. 


PROOF OF THE THEOREM 27.3. According to the definition 
27.4, in order to prove the theorem we need to verify two linearity 
conditions for the mapping ma. The first of these conditions in 
our particular case is written as the equality 


Talb + c) = Ta(b) + Ta (c). (27.7) 


Let’s denote d = b +c. According to the theorem 27.1, there are 
expansions of the form (27.1) for the vectors b, c, and d: 


c=c,+c,, (27.9) 
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d=d,+d,. (27.10) 


According the same theorem 27.1, the components of the expan- 
sions (27.8), (27.9), (27.10) are uniquely fixed by the conditions 


b, || a, b, La, (27.11) 
cı || a, c, La, (27.12) 
d, || a, d, La. (27.13) 


Adding the equalities (27.8) and (27.9), we get 
d=b+c=(b,+c,) + (bi +c). (27.14) 


Due to (27.11) and (27.12) we can apply the lemma 27.1 to the 
components of the expansion (27.14). This yields 


(b, + cı) Il a, (bi + c) ta. (27.15) 


The rest is to compare (27.14) with (27.10) and (27.15) with 
(27.13). From this comparison, applying the theorem 27.1, we 
derive the following relationships: 


dı = bı +€, d, =b, +c.. (27.16) 


According to the definition 27.3, the first of the above relation- 
ships (27.16) is equivalent to the equality (27.7) which was to be 
verified. 

Let’s proceed to proving the second linearity condition for the 
mapping Ta. It is written as follows: 


Tala b) = a Ta(b). (27.17) 
Let’s denote e = a b and then, applying the theorem 27.1, write 


e=e,+e,. (27.19) 
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According to the theorem 27.1, the components of the expansions 
(27.18) and (27.19) are uniquely fixed by the conditions 


b, || a, b, La, (27.20) 
e; || a, e, La. (27.21) 


Let’s multiply both sides of (27.18) by a. Then we get 
e=ab=ab,+ab,. (27.22) 


Multiplying a vector by the number a, we get a vector collinear 
to the initial vector. For this reason from (27.20) we derive 


(abı) || a, (ab,) La, (27.23) 


Let’s compare (27.22) with (27.19) and (27.23) with (27.21). 
Then, applying the theorem 27.1, we obtain 


ej =ab,, €y =ab,. (27.24) 
According to the definition 27.3, the first of the equalities (27.24) 


is equivalent to the required equality (27.17). The theorem 27.3 
is proved. 


§ 28. Properties of the scalar product. 


THEOREM 28.1. The scalar product of vectors possesses the 
following four properties which are fulfilled for any three vectors 
a, b, c and for any number a: 


I 

2) ( 

3) (~a, c) =a (a,c); 

4) (a,a) > 0 and (a,a) = 0 implies a = 0. 
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DEFINITION 28.1. The property 1) in the theorem 28.1 is 
called the property of symmetry; the properties 2) and 3) are 
called the properties of linearity with respect to the first multipli- 
cand; the property 4) is called the property of positivity. 


PROOF OF THE THEOREM 28.1. The property of symmetry 1) 
is immediate from the definition 26.1 and the formula (26.1) in 
this definition. Indeed, if one of the vectors a or b is equal to 
zero, then both sides of the equality (a,b) = (b,a) equal to zero. 
Hence the equality is fulfilled in this case. 

In the case of the nonzero vectors a and b the angle ọ is 
determined by the pair of vectors a and b according to Fig. 26.1, 
it does not depend on the order of vectors in this pair. Therefore 
the equality (a,b) = (b,a) in this case is reduced to the equality 


|a| |b| cos y = |b] |a| cos ọ. 


It is obviously fulfilled since |a| and |b| are numbers comple- 
mented by some measure units depending on the physical nature 
of the vectors a and b. 

Let’s consider the properties of linearity 2) and 3). If c = 0, 
then both sides of the equalities (a + b,c) = (a,c) + (b,c) and 
(aa,b) = a (a,b) are equal to zero. Hence these equalities are 
fulfilled in this case. 

If c £0, then we apply the theorems 27.2 and 27.3. From the 
theorems 27.2 we derive the following equalities: 


Tela + b) — Teha) — te(b) = — c, 


(aa,c)-— a (a,c) 


Te(@a) — artela) = RE 


Due to the theorem 27.3 the mapping Te is a linear mapping 
(see Definition 27.4). Therefore the left hand sides of the above 
equalities are zero. Now due to c Æ 0 we conclude that the 


§29.... IN A SKEW-ANGULAR BASIS. 75 


numerators of the fractions in their right hand sides are also zero. 
This fact proves the properties 2) and 3) from the theorem 28.1 
are valid in the case c Æ 0. 

According to the definition 26.1 the scalar product (a,a) is 
equal to zero for a = 0. Otherwise, if a 4 0, the formula (26.1) 
is applied where we should set b = a. This yields y = 0 and 


(a,a) = |al? > 0. 


This inequality proves the property 4) and completes the proof 
of the theorem 28.1 in whole. 


THEOREM 28.2. Apart from the properties 1)-4), the scalar 
product of vectors possesses the following two properties fulfilled 
for any three vectors a, b, c and for any number a: 


5) (c,a+b) = (c,a) + (a,b); 
6) (c,aa) =a(c,a). 


DEFINITION 28.2. The properties 5) and 6) in the theo- 
rem 28.2 are called the properties of linearity with respect to 
the second multiplicand. 


The properties 5) and 6) are easily derived from the properties 
2) and 3) by applying the property 1). Indeed, we have 


(c,a+ b) = (a + b,c) = (a,c) + (b,c) = (c,a) + (c, b), 


(c,aa) = (aa,c) =a(a,c) =a(c,a). 
These calculations prove the theorem 28.2. 


§29. Calculation of the scalar product through the 
coordinates of vectors in a skew-angular basis. 


Let e1, e2, e3 be some arbitrary basis in the space E. Accord- 
ing to the definition 18.1, this is an ordered triple of non-coplanar 
vectors. The arbitrariness of a basis means that no auxiliary re- 
strictions are imposed onto the vectors e1, e2, e3, except for 
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non-coplanarity. In particular, this means that the angles be- 
tween the vectors e;, e2, e3 in an arbitrary basis should not be 
right angles. For this reason such a basis is called a skew-angular 
basis and abbreviated as SAB. 


DEFINITION 29.1. In this book a skew-angular basis (SAB) is 
understood as an arbitrary basis. 


Thus, let e1, e2, eg be some skew-angular basis in the space E 
and let a and b be two free vectors given by its coordinates in 
this basis. We write this fact as follows: 


al bt 
a= || all, b = || 0? (29.1) 
a? b3 


Unlike (21.1), instead of the arrow sign in (29.1) we use the 
equality sign. Doing this, we emphasize the fact that once a basis 
is fixed, vectors are uniquely identified with their coordinates. 

The conditional writing (29.1) means that the vectors a and b 
are presented by the following expansions: 


3 3 
a=) de; b=) bej. (29.2) 
i=1 j=l 


Substituting (29.2) into the scalar product (a,b), we get 


(a,b) = (Ze ei, ye e). (29.3) 


In order to transform the formulas (29.3) we apply the properties 
2) and 5) of the scalar product from the theorems 28.1 and 28.2. 
Due to these properties we can take the summation signs over i 
and j out of the brackets of the scalar product: 


3 3 


(a,b) = Ds (a’ e;, bt e;). (29.4) 
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Then we apply the properties 3) and 6) from the theorems 28.1 
and 28.2. Due to these properties we can take the numeric factors 
a’ and bÎ out of the brackets of the scalar product in (29.4): 


3 3 
(a,b) = de bÍ (e;, €;). (29.5) 


The quantities (e;,e;) in the formula (29.5) depend on a basis 
€1, €2, €3, namely on the lengths of the basis vectors and on the 
angles between them. They do not depend on the vectors a and 
b. The quantities (e;,e;) constitute an array of nine numbers 


enumerated by two lower indices. The components of the array 
(29.6) are usually arranged into a square matrix: 


gir 912 913 
G = || 92 922 923 (29.7) 


931 932 933 


DEFINITION 29.2. The matrix (29.7) with the components 
(29.6) is called the Gram matrix of a basis e1, e2, e3. 


Taking into account the notations (29.6), we write the formula 
(29.5) in the following way: 


3 3 
(a,b) = SoS a b gis. (29.8) 
i=1 j=1 


DEFINITION 29.3. The formula (29.8) is called the formula for 
calculating the scalar product through the coordinates of vectors in 
a skew-angular basis. 
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The formula (29.8) can be written in the matrix form 


gu 912 913 b1 
(a,b) =|jat a? a? ||- |f ga g22 923 |} - I0? (29.9) 
3 
931 932 933 b 
Note that the coordinate column of the vector b in the formula 
(29.9) is used as it is, while the coordinate column of the vector 


a is transformed into a row. Such a transformation is known as 
matrix transposing (see [7]). 


DEFINITION 29.4. A transformation of a rectangular matrix 
under which the element in the intersection of i-th row and j-th 
column is taken to the intersection of j-th row and i-th column is 
called the matrix transposing. It is denoted by means of the sign 
T. In the TeX and LaTẸX computer packages this sign is coded 
by the operator \top. 


The operation of matrix transposing can be understood as the 
mirror reflection with respect to the main diagonal of a matrix 

Taking into account the notations (29.1), (29.7), and the de- 
finition 29.4, we can write the matrix formula (29.9) as follows: 


(a,b) =a'-G-b. (29.10) 


In the right hand side of the formula (29.10) the vectors a 
and b are presented by their coordinate columns, while the 
transformation of one of them into a row is written through the 
matrix transposing. 


EXERCISE 29.1. Show that for an arbitrary rectangular matrix 
A the equality (A‘')" = A is fulfilled. 


EXERCISE 29.2. Show that for the product of two matrices A 
and B the equality (A - B)" = B"- A” is fulfilled. 
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EXERCISE 29.3. Define the Gram matrices for bases on a line 
and for bases on a plane. Write analogs of the formulas (29.8), 
(29.9), and (29.10) for the scalar product of vectors lying on a line 
and on a plane. 


§ 30. Symmetry of the Gram matrix. 


DEFINITION 30.1. A square matrix A is called symmetric, if 
it is preserved under transposing, i.e. if the following equality is 
fulfilled: AT = A. 


Gram matrices possesses many important properties. One of 
these properties is their symmetry. 


THEOREM 30.1. The Gram matrix G of any basis e1, e2, €3 
in the space E is symmetric. 


PROOF. According to the definition 30.1 the symmetry of G 
is expressed by the formula G' = G. According to the defini- 
tion 29.4, the equality G’™ = G is equivalent to the relationship 


Jij = Iji (30.1) 


for the components of the matrix G. As for the relationship 
(30.1), upon applying (29.6), it reduces to the equality 


(ei ej) = (ej, ei) 


which is fulfilled due to the symmetry of the scalar product (see 
Theorem 28.1 and Definition 28.1). 


Note that the coordinate columns of the vectors a and b 
enter the right hand side of the formula (29.10) in somewhat 
unequal way — one of them is transposed, the other is not 
transposed. The symmetry of the matrix G eliminates this 
difference. Redesignating the indices 7 and j in the double sum 
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(29.8) and taking into account the relationship (30.1) for the 
components of the Gram matrix, we get 


3 3 


3 3 


j=1 i=1 i=1 j=1 
In the matrix form the formula (30.2) is written as follows: 
(a,b) =b'-G-a. (30.3) 


The formula (30.3) is analogous to the formula (29.10), but in 
this formula the coordinate column of the vector b is transposed, 
while the coordinate column of the vector a is not transposed. 


EXERCISE 30.1. Formulate and prove a theorem analogous to 
the theorem 30.1 for bases on a plane. Is it necessary to formulate 
such a theorem for bases on a line. 


§ 31. Orthonormal basis. 


DEFINITION 31.1. A basis on a straight line consisting of a 
nonzero vector e is called an orthonormal basis, if e is a unit 
vector, i.e. if Je] = 1. 


DEFINITION 31.2. A basis on a plane, consisting of two non- 
collinear vectors e1, €2, is called an orthonormal basis, if the 
vectors e; and ez are two vectors of the unit lengths perpendicu- 
lar to each other. 


DEFINITION 31.3. A basis in the space E consisting of three 
non-coplanar vectors e;, €2, e3 is called an orthonormal basis 
if the vectors e1, e2, e3 are three vectors of the unit lengths 
perpendicular to each other. 


In order to denote an orthonormal basis in each of the there 
cases listed above we use the abbreviation ONB. According to 


§32. ... OF AN ORTHONORMAL BASIS. 81 


the definition 29.1 the orthonormal basis is not opposed to a 
skew-angular basis SAB, it is a special case of such a basis. 

Note that the unit lengths of the basis vectors of an orthonor- 
mal basis in the definitions 31.1, 31.2, and 31.2 mean that their 
lengths are not one centimeter, not one meter, not one kilometer, 
but the pure numeric unity. For this reason all geometric realiza- 
tions of such vectors are conditionally geometric (see §2). Like 
velocity vectors, acceleration vectors, and many other physical 
quantities, basis vectors of an orthonormal basis can be drawn 
only upon choosing some scaling factor. Such a factor in this 
particular case is needed for to transform the numeric unity into 
a unit of length. 


§ 32. Gram matrix of an orthonormal basis. 


Let e1, e2, €3 be some orthonormal basis in the space E. 
According to the definition 31.3 the vectors e1, e2, e3 satisfy the 
following relationships: 


[e| =l, |e2| =1, |e3| =1, 
(32.1) 
e1 Le, e2 L es, e3 L e. 


Applying (32.1) and (29.6), we find the components of the Gram 
matrix for the orthonormal basis e;, e2, e3: 


1 for i=j; 
Jij = { : : (32.2) 
0 for ify. 


From (32.2) we immediately derive the following theorem. 


THEOREM 32.1. The Gram matrix (29.7) of any orthonormal 
basis is a unit matrix: 


100 
G=|/0 1 oļ=1. (32.3) 
001 
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Let’s recall that the components of a unit matrix constitute 
a numeric array 6 which is called the Kronecker symbol (see 
Definition 23.1). Therefore, taking into account (23.3), the 
equality (32.2) can be written as: 


Jij = 9ij- (32.4) 


The Kronecker symbol in (32.4) inherits the lower position of 
indices from gij. Therefore it is different from the Kronecker 
symbol in (23.4). Despite being absolutely identical, the compo- 
nents of the unit matrix in (32.3) and in (23.1) are of absolutely 
different nature. Having negotiated to use indices on upper and 
lower levels (see Definition 20.1), now we are able to reflect this 
difference in denoting these components. 


§ 33. Calculation of the scalar product through the 
coordinates of vectors in an orthonormal basis. 


According to the definition 29.1 the term skew-angular basis 
is used as a synonym of an arbitrary basis. For this reason an 
orthonormal basis is a special case of a skew-angular basis and 
we can use the formula (29.8), taking into account (32.4): 


3 3 
(a,b) = $0 J dV bi. (33.1) 


i=1 j=1 


In calculating the sum over j in (33.1), it is the inner sum here, 
the index 7 runs over three values and only for one of these three 
values, where j = 7, the Kronecker symbol 6;; is nonzero. For 
this reason wee can retain only one summand of the inner sum 
over j in (33.1), omitting other two summands: 


3 
(a,b) = Soa’ b! bj. (33.2) 
i=l 
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We know that 6;; = 1. Therefore the formula (33.2) turns to 


3 
(a b)= > ah (33.3) 


DEFINITION 33.1. The formula (33.3) is called the formula for 
calculating the scalar product through the coordinates of vectors in 
an orthonormal basis. 


Note that the sums in the formula (29.8) satisfy the index 
setting rule from the definition 24.8, while the sum in the formula 
(33.3) breaks this rule. In this formula the summation index has 
two entries and both of them are in the upper positions. This 
is a peculiarity of an orthonormal basis. It is more symmetric 
as compared to a general skew-angular basis and this symmetry 
hides some rules that reveal in a general non-symmetric bases. 

The formula (33.3) has the following matrix form: 


(a,b) =|ja! a? a? || - |b? (33.4) 
Taking into account the notations (29.1) and taking into account 
the definition 29.4, the formula (33.4) can be abbreviated to 


(a,b) =a'-b. (33.5) 


The formula (33.4) can be derived from the formula (29.9), while 
the formula (33.5) can be derived from the formula (29.10). 


§ 34. Right and left triples of 
vectors. The concept of orientation. 


DEFINITION 34.1. An ordered triple of vectors is a list of three 
vectors for which the order of listing vectors is fixed. 
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DEFINITION 34.2. An ordered triple of non-coplanar vectors 
aı, a2, ag is called a right triple if, when observing from the end 
of the third vector, the shortest 
rotation from the first vector to- 
ward the second vector is seen 
as a counterclockwise rotation. 


In the definition 34.2 we im- 
plicitly assume that the geomet- 
ric realizations of the vectors aj, 
a2, a3 with some common ini- 
tial point are considered as it is 
Fig. 34.1 shown in Fig. 34.1. 


DEFINITION 34.3. An ordered triple of non-coplanar vectors 
A 1, a2, ag is called a left triple if, when observing from the end 
of the third vector, the shortest rotation from the first vector 
toward the second vector is seen as a clockwise rotation. 


A given rotation about a given axis when observing from a 
given position could be either a clockwise rotation or a coun- 
terclockwise rotation. No other options are available. For this 
reason each ordered triple of non-coplanar vectors is either left or 
right. No other triples sorted by this criterion are available. 


DEFINITION 34.4. The property of ordered triples of non- 
coplanar vectors to be left or right is called their orientation. 


§35. Vector product. 


Let a and b be two non-collinear free vectors. Let’s lay their 
geometric realizations a = OA and b = OB at some arbitrary 
point O. In this case the vectors a and b define a plane AOB 
and lie on this plane. The angle y between the vectors a and 
b is determined according to Fig. 26.1. Due to a }{ b this angle 
ranges in the interval 0 < p < m and hence sin y Æ 0. 

Let’s draw a line through the point O perpendicular to the 
plane AOB and denote this line through c. The line c is 
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—> —> 
perpendicular to the vectors a = OA and b = OB: 


cla, 


clb. 


(35.1) 


It is clear that the conditions 

(35.1) fix a unique line c passing 

through the point O (see Theo- 

rems 1.1 and 1.3 in Chapter IV 
of the book [6]). 

There are two directions on 

Fig. 35.1 the line c. In Fig35.1 they are 

given by the vectors c and č. 

The vectors a, b, c constitute a right triple, while a, b, č is a left 

triple. So, specifying the orientation chooses one of two possible 

directions on the line c. 


DEFINITION 35.1. The vector product of two non-collinear 
vectors a and b is a vector c = |a, b] which is determined by the 
following three conditions: 


1) ¢ La and cL b; 

2) the vectors a, b, c form a right triple; 

3) |e] = [al |b| sing. 
In the case of collinear vectors a and b their vector product [a, b] 
is taken to be zero by definition. 


A comma is the multiplication sign in the writing the vec- 
tor product, not by itself, but together with square brackets 
surrounding the whole expression. These brackets are natural de- 
limiters for multiplicands: the first multiplicand is an expression 
between the opening bracket and the comma, while the second 
multiplicand is an expression between the comma and the clos- 
ing bracket. Therefore in complicated expressions no auxiliary 
delimiters are required. For example, in the formula 


[a+b,c+d] 
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the sums a + b and c +d are calculated first, then the vector 
multiplication is performed. 

A remark. Often the vector product is written as a x b. 
Even the special term «cross product» is used. However, to my 
mind, this notation is not good. It is misleading since the cross 
sign is sometimes used for denoting the product of numbers when 
a large formula is split into several lines. 

A remark. The physical nature of the vector product |a, b] 
often differs from the nature of its multiplicands a and b. Even if 
the lengths of the vectors a and b are measured in length units, 
the length of their product fa, b] is measured in units of area. 


EXERCISE 35.1. Show that the vector product c = [a,b] of 
two free vectors a and b is a free vector and, being a free vector, 
it does not depend on where the point O in Fig. 35.1 is placed. 


§ 36. Orthogonal projection onto a plane. 


Let a # 0 be some nonzero free vector. According to the 
theorem 27.1, each free vector b has the expansion 


relative to the vector a, where the vector b, is collinear to the 
vector a, while the vector b, is perpendicular to the vector a. 
Recall that through ma we denoted a mapping that associates 
each vector b with its component b, in the expansion (36.1). 
Such a mapping was called the orthogonal projection onto the 
direction of the vector a £ 0 (see Definition 27.3). 


DEFINITION 36.1. The mapping Ta that associates each free 
vector b with its perpendicular component b, in the expansion 
(36.1) is called the orthogonal projection onto a plane perpendicu- 
lar to the vector a 4 0 or, more exactly, the orthogonal projection 
onto the orthogonal complement of the vector a # 0. 


DEFINITION 36.2. The orthogonal complement of a free vector 
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a is the collection of all free vectors x perpendicular to a: 


a= (x: xla}. (36.2) 


The orthogonal complement (36.2) of a nonzero vector a 4 0 
can be visualized as a plane if we choose one of its geometric 
A realizations a = OA’ Indeed, 
a let’s lay various vectors perpen- 
dicular to a at the point O. The 
ending points of such vectors fill 
the plane a shown in Fig. 36.1. 
The properties of the orthog- 
onal projections onto a line ma 
from the definition 27.1 and the 
orthogonal projections onto a 
plane mia from the definition 
36.1 are very similar. Indeed, we have the following theorem. 


Fig. 36.1 


THEOREM 36.1. For any nonzero vector a Æ 0 the orthogonal 
projection Tia onto a plane perpendicular to the vector a is a 
linear mapping. 


PROOF. In order to prove the theorem 36.1 we write the 
relationship (36.1) as follows: 
b = ma (b) + ma(b). (36.3) 


The relationship (36.3) is an identity, it is fulfilled for any vector 
b. First we replace the vector b by b+ c in (36.3), then we 
replace b by ab in (36.3). As a result we get two relationships 


Ma(b+c)=b+c—7,(b+c), (36.4) 
Ta(ab) =qab -— Talab). (36.5) 


Due to the theorem 27.3 the mapping 7, is a linear map- 
ping. For this reason the relationships (36.4) and (36.5) can be 
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transformed into the following two relationships: 
Ta(b+c) = b — malb) +c — malc), (36.6) 
Tialab) = a(b — 7(b)). (36.7) 


The rest is to apply the identity (36.3) to the relationships (36.6) 
and (36.7). As a result we get 


Ta(b +c) = Talb) + Talc), (36.8) 
Tala b) = & Talb). (36.9) 
The relationships (36.8) and (36.9) are exactly the linearity 


conditions from the definition 27.4 written for the mapping Tia. 
The theorem 36.1 is proved. 


$837. Rotation about an axis. 


Let a 4 0 be some nonzero free vector and let b be some arbi- 
trary free vector. Let’s lay the vector b = BO at some arbitrary 
point B. Then we lay the vector a = OA at the terminal point of 
the vector BO. The vector a = Ov is nonzero. For this reason 
it defines a line OA. We take this line for the rotation axis. Let’s 
denote through 0% the rotation of the space E about the axis OA 

by the angle » (see Fig. 37.1). 


The vector a = OA fixes one 


A of two directions on the rotation 
a axis. At the same time this vec- 
O tor fixes the positive direction of 


rotation about the axis OA. 


DEFINITION 37.1. The rota- 
tion about an axis OA with with 
fixed direction a = OA on it 
is called a positive rotation if, 
being observed from the termi- 
Fig. 37.1 nal point of the vector OA’. ie. 
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when looking from the point A toward the point O, it occurs in 
the counterclockwise direction. 


Taking into account the definition 37.1, we can consider the 
rotation angle y as a signed quantity. If p > 0, the rotation 0% 
occurs in the positive direction with respect to the vector a, if 
p < 0, it occurs in the negative direction. 

: ; A ae 

Let’s apply the rotation mapping 0% to the vectors a = OA 
and b = BO in Fig. 37.1. The points A and O are on the 
rotation axis. For this reason under the rotation 6% the points 
A and O stay at their places and the vector a = OA does not 
change. As for the vector b = BO, it is mapped onto another 

tor BO. N lyi rallel translations, we can replicate 
vector Now, applying parallel t a p 
the vector BO up to a free vector b = BO (see Definitions 4.1 


and 4.2). The vector b is said to be produced from the vector b 
by applying the mapping 0f and is written as 


b = 0£ (b). (37.1) 


LEMMA 37.1. The free vector b = 0%(b) in (37.1) produced 
from a free vector b by means of the rotation mapping 0 does 
not depend on the choice of a geometric realization of the vector 
a defining the rotation axis and on a geometric realization of the 
vector b itself. 


DEFINITION 37.2. The mapping 0% acting upon free vectors 
of the space E and taking them to other free vectors in E is called 
the rotation by the angle y about the vector a. 


EXERCISE 37.1. Rotations and parallel translations belong to 
the class of mappings preserving lengths of segments and measures 
of angles. They take each segment to a congruent segment and 
each angle to a congruent angle (see [6]). Let p be some parallel 
translation, let p~! be its inverse parallel translation, and let 0 
be a rotation by some angle about some axis. Prove that the 
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composite mapping 6 = poop! is the rotation by the same 
angle about the axis produced from the axis of 0 by applying the 
parallel translation p to it. 


EXERCISE 37.2. Apply the result of the exercise 37.1 for prov- 
ing the lemma 37.1. 


THEOREM 37.1. For any nonzero free vector a # 0 and for 
any angle y the rotation 0£ by the angle p about the vector a is 
a linear mapping of free vectors. 


PROOF. In order to prove the theorem we need to inspect the 
conditions 1) and 2) from the definition 27.4 for the mapping 02. 
Let’s begin with the first of these conditions. Assume that b and 
c are two free vectors. Let’s build their geometric realizations 
b= BC and c = CO. . Then the vector BO is the geometric 
realization for the sum of vectors b + c. 

Now let’s choose a geometric realization for the vector a. It 
determines the rotation axis. According to the lemma 37.1 the 
actual place of such a geometric realization does not matter for 
the ultimate definition of the mapping 0% as applied to free 

i ar 
vectors. But for the sake of certainty we choose a= OA. 

Let’s apply the rotation by the angle y about the axis OA to 
the points B, C, and O. The point O is on the rotation axis. 
Therefore it is not moved. The points B and C are moved to the 
points B and C respectively, while the triangle BCO is moved to 
the triangle BCO. As a result we get the following relationships: 


= =a? 

9°(BO’) = BC, 
== =? 

92(CO’) =CO, (37.2) 
= => 

02( BO) = Bo. 


ema = 
But the vectors BC and CO in (37.2) are geometric realizations 


for the vectors b = 0%(b) and č = 0£ (c), while BO is a geometric 
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realization for the vector 0£(b +c). Hence we have 


z so = 
6%(b +c) = BO =BC +CO =62%(b) +6%(c). (37.3) 


The chain of equalities (37.3) proves the first linearity condition 
from the definition 27.4 as applied to 6%. 

Let’s proceed to proving the second linearity condition. It is 
more simple than the first one. Multiplying a vector b by a 
number a, we make the length of its geometric realizations |a| 
times as greater. If a > 0, geometric realizations of the vector ab 
are codirected to geometric realizations of b. If a < 0, they are 
opposite to geometric realizations of b. And if a = 0, geometric 
realizations of the vector ab do vanish. Let’s apply the rotation 
by the angle y about some geometric realization of the vector 
a #0 some geometric realizations of the vectors b and ab. Such 
a mapping preserves the lengths of vectors. Hence it preserves all 
the relations of their lengths. Moreover it maps straight lines to 
straight lines and preserves the order of points on that straight 
lines. Hence codirected vectors are mapped to codirected ones 
and opposite vectors to opposite ones respectively. As a result 


02? (ab) = a6%(b). (37.4) 


The relationship (37.4) completes the proof of the linearity for 
the mapping 6% as applied to free vectors. 


§ 38. The relation of the vector 
product with projections and rotations. 


Let’s consider two non-collinear vectors a }{ b and their vector 
product c = [a,b]. The length of the vector c is determined by 
the lengths of a and b and by the angle y between them: 


lc| = |a| |b] sin y. (38.1) 
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The vector c lies on the plane 
@ perpendicular to the vector 
b (see Fig. 38.1). Let’s denote 
through a, the orthogonal pro- 
jection of the vector a onto the 
plane a, i e. we set 


a, =i (a). (38.2) 


The length of the above vec- 

Fig. 38.1 tor (38.2) is determined by the 

formula ja,_| = |a| siny. Com- 

paring this formula with (38.1) and taking into account Fig. 38.1, 

we conclude that in order to superpose the vector a, with the 

vector c one should first rotate it counterclockwise by the right 

angle about the vector b and then multiply by the negative 
number —|b]|. This yields the formula 


[a, b] = -|b| - 66’? (mia (a)). (38.3) 


The formula (38.3) sets the relation of the vector product with 
the two mappings 7p and ae *. One of them is the projection 
onto the orthogonal complement of the vector b, while the other 
is the rotation by the angle 7/2 about the vector b. The formula 
(38.3) is applicable provided the vector b is nonzero: 


b £0, (38.4) 


while the condition a } b can be broken. If a || b both sides of 
the formula (38.4) vanish, but the formula itself remains valid. 


§ 39. Properties of the vector product. 


THEOREM 39.1. The vector product of vectors possesses the 
following four properties fulfilled for any three vectors a, b, c and 
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for any number a: 
1) [a,b] = —[b, al; 
2) [a +b, c] = [a,c] + [b,c]; 
3) [aa,c] = a |a, c]; 
4) [a,b] = 0 if and only if the vectors a and b 
are collinear, i.e. if a || b. 


DEFINITION 39.1. The property 1) in the theorem 39.1 is 
called anticommutativity; the properties 2) and 3) are called the 
properties of linearity with respect to the first multiplicand; the 
property 4) is called the vanishing condition. 


PROOF OF THE THEOREM 39.1. The property of anticommu- 
tativity 1) is derived immediately from the definition 35.1. Let 
a }{ b. Exchanging the vectors a and b, we do not violate the 
first and the third conditions for the triple of vectors a, b, c in 
the definition 35.1, provided they are initially fulfilled. As for 
the direction of rotation in Fig. 35.1, it changes for the opposite 
one. Therefore, if the triple a, b, c is right, the triple b, a, c is 
left. In order to get a right triple the vectors b and a should be 
complemented with the vector —c. This yield the equality 


[a, b] = -[b,a] (39.1) 


for the case a } b. If a || b, both sides of the equality (39.1) do 
vanish. So the equality remains valid in this case too. 

Let c # 0. The properties of linearity 1) and 2) for this 
case are derived with the use of the formula (38.3) and the 
theorems 36.1 and 37.1. Let’s write the formula (38.3) as 


[a,c] = —|e| - 02’? (me(a)). (39.2) 


Then we change the vector a in (39.2) for the sum of vectors 
a+b and apply the theorems 36.1 and 37.1. This yields 


[a + b,c] = —|e| - 09° (mc(a + b)) = —|e|- 
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ae (mc(a) + T¢(b)) = —|e] - 0g”? (TLe(a)) a 
— |c| - 07? (m,¢(b)) = [a,c] + [b, c]. 


Now we change the vector a in (39.2) for the product aa and 
then apply the theorems 36.1 and 37.1 again: 


[a a, c] = —|c| ` Oo (Tiela a)) = —|c| g gee (a T™¢(a)) = 


= -a|c| - 02/? (m@(a)) =a [a,c]. 


The calculations which are performed above prove the equali- 
ties 2) and 3) in the theorem 39.1 for the case c # 0. If c = 0, 
both sides of these equalities do vanish and they appear to be 
trivially fulfilled. 

Let’s proceed to proving the fourth item in the theorem 39.1. 
For a || b the vector product [a, b] vanishes by the definition 35.1. 
Let a ¥ b. In this case both vectors a and b are nonzero, while 
the angle y between them differs from 0 and m. For this reason 
siny #0. Summarizing these restrictions and applying the item 
3) of the definition 35.1, we get 


|[a, b]| = |a| |b] siny 4 0, 


i.e. for a } b the vector product [a,b] cannot vanish. The proof 
of the theorem 39.1 is over. 


THEOREM 39.2. Apart from the properties 1)—4), the vector 
product possesses the following two properties which are fulfilled 
for any vectors a, b, c and for any number a: 

5) [c,a + b] =[c,a] + [a,b]; 
6) [c,aa] = a [c, a]. 


DEFINITION 39.2. The properties 5) and 6) in the theo- 
rem 39.2 are called the properties of linearity with respect to 
the second multiplicand. 
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The properties 5) and 6) are easily derived from the properties 
2) and 3) by applying the property 1). Indeed, we have 


[c,a+ b] = —[a+ b,c] = —[a, c] — [b,c] = [c, a] + [c, b], 


[c, aa] = -|a a,c] = —a [a,c] = a |c, a]. 
These calculations prove the theorem 39.2. 


§ 40. Structural constants of the vector product. 


Let e1, e2, e3 be some arbitrary basis in the space E. Let’s 
take two vectors e; and e; of this basis and consider their vector 
product |e;,e;]. The vector [e;,e;] can be expanded in the basis 
€1, €2, €3. Such an expansion is usually written as follows: 


ley, €;] = Ci e, + CZ, €o +C} e3. (40.1) 


The expansion (40.1) contains three coefficients C}, Ci, and 
CE. However, the indices 7 and j in it run independently over 
three values 1, 2, 3. For this reason, actually, the formula (40.1) 
represent nine expansions, the total number of coefficients in it is 
equal to twenty seven. 


The formula (40.1) can be abbreviated in the following way: 


3 
[ei,e;] = > Ch er. (40.2) 
k=1 


Let’s apply the theorem 19.1 on the uniqueness of the expansion 
of a vector in a basis to the expansions of [e;,e;] in (40.1) or in 
(40.2). As a result we can formulate the following theorem. 


THEOREM 40.1. Each basis e1, e2, e3 in the space E is asso- 
ciated with a collection of twenty seven constants Ck which are 
determined uniquely by this basis through the expansions (40.2). 


DEFINITION 40.1. The constants Ch, which are uniquely de- 
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termined by a basis e1, e2, e3 through the expansions (40.2), are 
called the structural constants of the vector product in this basis. 


The structural constants of the vector product are similar to 
the components of the Gram matrix for a basis e1, e2, e3 (see 
Definition 29.2). But they are more numerous and form a three 
index array with two lower indices and one upper index. For this 
reason they cannot be placed into a matrix. 


§41. Calculation of the vector product through the 
coordinates of vectors in a skew-angular basis. 


Let e1, e2, e3 be some skew-angular basis. According to the 
definition 29.1 the term skew-angular basis in this book is used 
as a synonym of an arbitrary basis. Let’s choose some arbitrary 
vectors a and b in the space E and consider their expansions 


3 3 
a=) de; b=) bej (41.1) 
i=1 j=l 


in the basis e1, e2, e3. Substituting (41.1) into the vector product 
la, b], we get the following formula: 


b] = [doe ei, D e|: (41.2) 


In order to transform the formula (41.2) we apply the properties 
2) and 5) of the vector product (see Theorems 39.1 and 39.2). 
Due to these properties we can bring the summation signs over i 
and j outside the brackets of the vector product: 


3 3 
[a, b] [=> > e e;, bl e;]. (41.3) 


a=1 j=1 


Now let’s apply the properties 3) and 6) from the theorems 39.1 
and 39.2. Due to these properties we can bring the numeric 
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factors af and b/ outside the brackets of the vector product (41.3): 


3 3 
b] = aa Í [e;,e;]. (41.4) 


The vector products [e;,e;] in the formula (41.4) can be 
replaced by their expansions (40.2). Upon substituting (40.2) 
into (41.4) the formula (41.4) is written as follows: 


3 3 3 
fa, b] =} a CE eg. (41.5) 


DEFINITION 41.1. The formula (41.5) is called the formula for 
calculating the vector product through the coordinates of vectors in 
a skew-angular basis. 


§ 42. Structural constants of the vector 
product in an orthonormal basis. 


+ 


Let’s recall that an orthonormal basis (ONB) in the space E 
is a basis composed by three unit vectors perpendicular to each 
other (see Definition 31.3). By their orientation, triples of non- 
coplanar vectors in the space E are subdivided into right and left 
triples (see Definition 34.4). Therefore all bases in the space E 
are subdivided into right bases 
and left bases, which applies to 
orthonormal bases as well. 

Let’s consider some right or- 
thonormal basis e1, e2, e3. It is 
shown in Fig. 42.1. Using the 
definition 35.1, one can calculate 
various pairwise vector products 
of the vectors composing this ba- 
sis. Since the geometry of a right 


€3 


Fig. 42.1 
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ONB is rather simple, we can perform these calculations up to an 
explicit result and compose the multiplication table for e1, e2, e3: 


[e1,e1] = 0, [e1, e2] = €3, [e1,e3] = —e2, 
[e2,e1] =—e3, [e2,e2] = 0, [e2,e3] = e1, (42.1) 
[e3,e1] = e2, [e3,€2]} =—e1,  [e3,e3] = 0. 


Let’s choose the first of the relationships (42.1) and write its 
right hand side in the form of an expansion in the basis e1, e2, e3: 


[e1,€1] = 0eı + Veo + 063. (42.2) 


Let’s compare the expansion (42.2) with the expansion (40.1) 
written for the case i = 1 and j = 1: 


[ex, e1] = Ch e1 + Ce e2 + Ch e3. (42.3) 


Due to the uniqueness of the expansion of a vector in a basis (see 
Theorem 19.1) from (42.2) and (42.3) we derive 


Cira 0; Caci; C} =0. (42.4) 


Now let’s choose the second relationship (42.1) and write its 
right hand side in the form of an expansion in the basis e1, e2, e3: 


[e1, e2] = Oe, + Oe + les. (42.5) 


Comparing (42.5) with the expansion (40.1) written for the case 
i = 1 and j = 2, we get the values of the following constants: 


Che = 0, Cc. = 0, Ci = I; (42.6) 


Repeating this procedure for all relationships (42.1), we can get 
the complete set of relationships similar to (42.4) and (42.6). 
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Then we can organize them into a single list: 


Ch =0, Ci, = 0, Ci, = 0, 

Cih =0, Ch =0, Ch =I, 

Cis =0, Cis =el, Cis = 0, 

Cy = 0, Cy, = 0, Cy, =-l, 

C3 = 0, C> =0, c3 =0, (42.7) 
C33 =1, C33 =0, C33 =0, 

C= 0, C1, Co 0, 

C32 =-]1, C32 =0, C32 =1, 

Cł =0, C$; = 0, CÌ = 0. 


The formulas (42.7) determine all of the 27 structural constants 
of the vector product in a right orthonormal basis. Let’s write 
this result as a theorem. 


THEOREM 42.1. For any right orthonormal basis e1, e2, e3 in 
the space E the structural constants of the vector product are 
determined by the formulas (42.7). 


THEOREM 42.2. For any left orthonormal basis e1, e2, e3 in 
the space E the structural constants of the vector product are 
derived from (42.7) by changing signs «+>» for «—» and vise versa. 


EXERCISE 42.1. Draw a left orthonormal basis and, applying 
the definition 35.1 to the pairwise vector products of the basis 
vectors, derive the relationships analogous to (42.1). Then prove 
the theorem 42.2. 


§ 43. Levi-Civita symbol. 


Let’s examine the formulas (42.7) for the structural constants 
of the vector product in a right orthonormal basis. One can 
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easily observe the following pattern in them: 


Ch = 0 if there are coinciding values (43.1) 
of the indices i, j, k. 


The condition (43.1) describes all of the cases where the struc- 
tural constants in (42.7) do vanish. The cases where Ck = 1 are 
described by the condition 


Ck = 1 if the indices i,j,k take the values (43.2) 

1, 2,3), (2,3, 1), or (3, 1,2). 
Finally, the cases where Ck = —1 are described by the condition 
ck = —1 if the indices i,j,k take the values (43.3) 


(1,3,2), (3,2,1), or (2,1,3). 


The triples of numbers in (43.2) and (43.3) constitute the 
complete set of various permutations of the numbers 1, 2, 3: 


(43.4) 


The first three permutations in (43.4) are called even permuta- 
tions. They are produced from the right order of the numbers 
1, 2, 3 by applying an even number of pairwise transpositions to 
them. Indeed, we have 


(1,2,3); 
(1,2,3) = (2,1,3) > (2,3,1); 
EE (82) e 


The rest three permutations in (43.4) are called odd permutations. 
In the case of these three permutations we have 


(1,2,3) —> (1,3, 2); 
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(1; 2, 3) =n (3, 2, 1); 


023 —> @,1, 8). 


DEFINITION 43.1. The permutations (43.4) constitute a set, 
which is usually denoted through $3. If o € S3, then (—1)” 
means the parity of the permutation oc: 


1 if the permutation ø is even; 
(=1) = 


—1 if the permutation ø is odd. 


Zeros, unities, and minus unities from (43.1), (43.2), and (43.3) 
are usually united into a single numeric array: 


0 if there are coinciding values 
of the indices i, j, k; 


1 if the values of the indices i, j, k 
ijk _ = form an even permutation of (43.5) 
the numbers 1, 2, 3; f 


—1 ifthe values of the indices i, j, k 
form an odd permutation of the 
numbers 1, 2,3. 


DEFINITION 43.2. The numeric array € determined by the 
formula (43.5) is called the Levi-Civita symbol. 


When writing the components of the Levi-Civita symbol either 
three upper indices or three lower indices are used. Thus we 
emphasize the equity of all these three indices. Placing indices 
on different levels in the Levi-Civita symbol is not welcome. 
Summarizing what was said above, the formulas (43.1), (43.2), 
and (43.3) are written as follows: 


Ck = Eijk. (43.6) 


THEOREM 43.1. For any right orthonormal basis e1, e2, e3 the 


102 CHAPTER I. VECTOR ALGEBRA. 


structural constants of the vector product in such a basis are de- 
termined by the equality (43.6). 


In the case of a left orthonormal basis we have the theo- 
rem 42.2. It yields the equality 


Ch = Ejk (43.7) 


THEOREM 43.2. For any left orthonormal basis e1, e2, e3 the 
structural constants of the vector product in such a basis are de- 
termined by the equality (43.7). 


Note that the equalities (43.6) and (43.7) violate the index 
setting rule given in the definition 24.9. The matter is that the 
structural constants of the vector product Ck are the components 
of a geometric object. The places of their indices are determined 
by the index setting convention, which is known as Einstein’s 
tensorial notation (see Definition 20.1). As for the Levi-Civita 
symbol, it is an array of purely algebraic origin. 

The most important property of the Levi-Civita symbol is 
its complete skew symmetry or complete antisymmetry. It is 
expressed by the following equalities: 


Eijk = —Ejik, Eijk = —Eikj» Eijk = —Ekjis (43.8) 
gidk — —gjik, 
The equalities (43.8) mean, that under the transposition of any 
two indices the quantity ¢;;, = £*"" changes its sign. These 


equalities are easily derived from (43.5). 


§44. Calculation of the vector product through the 
coordinates of vectors in an orthonormal basis. 


Let’s recall that the term skew-angular basis in this book is 
used as a synonym of an arbitrary basis (see Definition 29.1). Let 
€1, G2, €3 be a right orthonormal basis. It can be treated as a 
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special case of a skew-angular basis. Substituting (43.6) into the 
formula (41.5), we obtain the formula 


a,b] = Š 


i=1 j 


3 3 
ya bÍ Eijk €k- (44.1) 
=1 k=1 
Here at and b/ are the coordinates of the vectors a and b in the 
basis €1, €2, €3. 

In order to simplify the formulas (44.1) note that the majority 
components of the Levi-Civita symbol are equal to zero. Only 
six of its twenty seven components are nonzero. Applying the 
formula (43.5), we can bring the formula (44.1) to 


[a, b] = a! b? e3 + a? b’? e1 +a? b! eo — 


44.2 
— a? b! e3 — a? b? e1 — a! b’ ev. l ) 
Upon collecting similar terms the formula (44.2) yields 
a,b] = e4 (a? b’ — a3 b?) — 
pees Oo 
— e2 (a b? — a? b) + 3 (a bf — af b`). 
Now from the formula (44.3) we derive 
a a? al a’ al a? 
[a, b] = e1 po p — e2 a + e3 a al (44.4) 


In deriving the formula (44.4) we used the formula for the 
determinant of a 2 x 2 matrix (see [7]). 

Note that the right hand side of the formula (44.4) is the 
expansion of the determinant of a 3 x 3 matrix by its first row 
(see [7]). Therefore this formula can be written as: 


e1 €2 €&3 
[a,b] = |a! a? aè). (44.5) 
bi b b 
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Here a', a?, a? and bt, b?, b’? are the coordinates of the vectors a 


and b in the basis e1, e2, e3. They fill the second and the third 
rows in the determinant (44.5). 


DEFINITION 44.1. The formulas (44.1) and (44.5) are called 
the formulas for calculating the vector product through the coordi- 
nates of vectors in a right orthonormal basis. 


Let’s proceed to the case of a left orthonormal basis. In this 
case the structural constants of the vector product are given by 
the formula (43.7). Substituting (43.7) into (41.5), we get 


3 3 3 
>D bI Eijk €k- (44.6) 


i=1 j=1 k=1 


Then from (44.6) we derive the formula 


ey; €2 €&3 
[a,b] =—J]a! a? a}. (44.7) 
bt b2 b3 


DEFINITION 44.2. The formulas (44.6) and (44.7) are called 
the formulas for calculating the vector product through the coordi- 
nates of vectors in a left orthonormal basis. 


§45. Mixed product. 


DEFINITION 45.1. The mixed product of three free vectors a, 
b, and c is a number obtained as the scalar product of the vector 
a by the vector product of b and c: 


(a, b,c) = (a, [b, c}). (45.1) 


As we see in the formula (45.1), the mixed product has 
three multiplicands. They are separated from each other by 
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commas. Commas are the multiplication signs in writing the 
mixed product, not by themselves, but together with the round 
brackets surrounding the whole expression. 

Commas and brackets in writing the mixed product are nat- 
ural delimiters for multiplicands: the first multiplicand is an 
expression between the opening bracket and the first comma, the 
second multiplicand is an expression placed between two com- 
mas, and the third multiplicand is an expression between the 
second comma and the closing bracket. Therefore in complicated 
expressions no auxiliary delimiters are required. Foe example, in 


(a+b,c+d,e+f) 


the sums a+ b, c+ d, and e + f are calculated first, then the 
mixed product itself is calculated. 


§ 46. Calculation of the mixed product through the 
coordinates of vectors in an orthonormal basis. 


The formula (45.1) reduces the calculation of the mixed prod- 
uct to successive calculations of the vectorial and scalar products. 
In the case of the vectorial and scalar products we already have 
rather efficient formulas for calculating them through the coordi- 
nates of vectors in an orthonormal basis. 

Let e1, e2, e3 be a right orthonormal basis and let a, b, and c 
be free vectors given by their coordinates in this basis: 


a! b! c! 
asa? || b=? il, eas e (46.1) 
a? b3 ro 


Let’s denote d = [b,c]. Then the formula (45.1) is written as 
(a,b,c) = (a,d). (46.2) 


In order to calculate the vector d = [b,c] we apply the formula 
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(44.1) which now is written as follows: 


a=y(Sde c se) ek- (46.3) 


a=1 J=1 


The formula (46.3) is an expansion of the vector d in the ba- 
sis €1, €2, €3. Hence the coefficients in this expansion should 
coincide with the coordinates of the vector d: 


3 3 
= DDOD cI Eijk. (46.4) 
i=1 j=1 


The next step consists in using the coordinates of the vector d 
from (46.4) for calculating the scalar product in the right hand 
side of the formula (46.2). The formula (33.3) now is written as 


3 
d=) ad". (46.5) 
k=1 


Let’s substitute (46.4) into (46.5) and take into account (46.2). 
As a result we get the formula 


(a,b,c) = ye (Mean): (46.6) 


Expanding the right hand side of the formula (46.6) and changing 
the order of summations in it, we bring it to 


3 3 3 
(a,b,c) =X X X b c eijk aă. (46.7) 
i=1 j=1 k=1 


Note that the right hand side of the formula (46.7) differs from 
that of the formula (44.1) by changing a’ for bt, changing bf for 
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cî, and changing ex for a*. For this reason the formula (46.7) 
can be brought to the following form analogous to (44.5): 


a @ aè 
(a,b,c) =]bt 8 è]. (46.8) 
ae e 


Another way for transforming the formula (46.7) is the use 
of the complete antisymmetry of the Levi-Civita symbol (43.8). 
Applying this property, we derive the identity £ijk = Ekij- Due 
to this identity, upon changing the order of multiplicands and 
redesignating the summation indices in the right hand side of the 
formula (46.7), we can bring this formula to the following form: 


3 
(a, bye)= Soy at co em. (46.9) 


DEFINITION 46.1. The formulas (46.8) and (46.9) are called 
the formulas for calculating the mixed product through the coordi- 
nates of vectors in a right orthonormal basis. 


The coordinates of the vectors a, b, and c used in the formulas 
(46.8) and (46.9) are taken from (46.1). 

Let’s proceed to the case of a left orthonormal basis. Analogs 
of the formulas (46.8) and (46.9) for this case are obtained by 
changing the sign in the formulas (46.8) and (46.9): 


at a g 
(a,b,c) =-|b & 6 |. (46.10) 
i 2 


3 3 3 
(a,b,c) =— SOS Y ak b e cigs. (46.11) 
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DEFINITION 46.2. The formulas (46.10) and (46.11) are called 
the formulas for calculating the mixed product through the coordi- 
nates of vectors in a left orthonormal basis. 


The formulas (46.10) and (46.11) can be derived using the 
theorem 42.2 or comparing the formulas (43.6) and (43.7). 


847. Properties of the mixed product. 


THEOREM 47.1. The mixed product possesses the following 
four properties which are fulfilled for any four vectors a, b, c, d 
and for any number a: 

1) (a,b,c) = —(a, c, b), 

(a,b,c) = —(c,b, a), 
(a,b,c) = —(b, a, c); 

2) (a+ b,c, d) = (a, c,d) + (b,c, d); 

3) (aa,c,d) = a (a, c,d); 

4) (a,b,c) = 0 if and only if the vectors a, b, and c are coplanar. 


DEFINITION 47.1. The property 1) expressed by three equal- 
ities in the theorem 47.1 is called the property of complete skew 
symmetry or complete antisymmetry, the properties 2) and 3) are 
called the properties of linearity with respect to the first multipli- 
cand, the property 4) is called the vanishing condition. 


PROOF OF THE THEOREM 47.1. The first of the three equali- 
ties composing the property of complete antisymmetry 1) follows 
from the formula (45.1) and the theorems 39.1 and 28.1: 


(a, b,c) = (a, [b,c]) = (a, -[c, b]) = —(a, [c, b]) = — (a, c, b). 


The other two equalities entering the property 1) cannot be 
derived in this way. Therefore we need to use the formula (46.8). 
Transposition of two vectors in the left hand side of this formula 
corresponds to the transposition of two rows in the determinant 
in the right hand side of this formula. It is well known that the 
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transposition of any two rows in a determinant changes its sign. 
This observation proves all of the three equalities composing the 
property of complete antisymmetry for the mixed product. 

The properties of linearity 2) and 3) of the mixed product in 
the theorem 47.1 are derived from the corresponding properties 
of the scalar and vectorial products due to the formula (45.1): 


(a+b,c,d) = (a+b, [e,d]) = (a, [c, d]) + 
+ (b, [c, d]) = (a, c,d) + (b, c,d), 


(aa,c,d) = (aa, [c,d]) = a (a, [c,d]) = a (a, c, d). 


Let’s proceed to proving the fourth property of the mixed 
product in the theorem 47.1. Assume that the vectors a, b, and 
c are coplanar. In this case they are parallel to some plane a in 
the space E and one can choose their geometric realizations lying 
on this plane a. If b }/ c, then the vector product d = [b,c] is 
nonzero and perpendicular to the plane a. As for the vector a, it 
is parallel to this plane. Hence d | a, which yields the equalities 
(a,b,c) = (a, [b,c]) = (a,d) = 0. 

If b || c, then the vector product [b,c] is equal to zero and the 
equality (a,b,c) = 0 is derived from [b,c] = 0 with use of the 
initial formula (45.1) for the scalar product. 

Now, conversely, assume that (a,b,c) = 0. If b || c, then the 
vectors a, b, and c determine not more than two directions in 
the space E. For any two lines in this space always there is a 
plane to which these lines are parallel. In this case the vectors a, 
b, and c are coplanar regardless to the equality (a,b,c) = 0. 

If b } c, then d = [b,c] 4 0. Choosing geometric realizations 
of the vectors b and c with some common initial point O, 
we easily build a plane œ comprising both of these geometric 
realizations. The vector d 0 in perpendicular to this plane a. 
Then from (a,b,c) = (a,[b,c]) = (a,d) = 0 we derive a L d, 
which yields a || a. The vectors b and c are also parallel to 
the plane a since their geometric realizations lie on this plane. 
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Hence all of the three vectors a, b, and c are parallel to the 
plane a, which means that they are coplanar. The theorem 47.1 
is completely proved. 


THEOREM 47.2. Apart from the properties 1)—4), the mixed 
product possesses the following four properties which are fulfilled 
for any four vectors a, b, c, d and for any number a: 

(c, at b, d) E (c, a, d) ug (c, b, d); 

(c,aa,d) = a(c,a,d); 

(c,d,a + b) = (c,d,a) + (c,d, b); 
(c,d,aa) = a (c,d,a). 


6 
T 


DEFINITION 47.2. The properties 5) and 6) in the theorem 
47.2 are called the properties of linearity with respect to the second 
multiplicand, the properties 7) and 8) are called the properties of 
linearity with respect to the third multiplicand. 


The property 5) is derived from the property 2) in the theo- 
rem 47.1 in the following way: 


(c,a+b,d) = —(a+b,c,d) = —((a,c,d) + (b,c,d)) = 
= —(a,c,d) — (b,c,d) = (c,a,d) + (c,b,d). 


The property 1) from this theorem is also used in the above 
calculations. As for the properties 6), 7), and 8) in the theo- 
rem 47.2, they are also easily derived from the properties 2) and 
3) with the use of the property 1). Indeed, we have 


(c,aa,d) = —(aa,c,d) = —a(a,c,d) = a(c,a,d), 


(c,d,a + b) = —(a+b,d,c) = —((a,d,c) + (b,d,c)) = 
= —(a,d,c) — (b,d,c) = (c,d,a) + (c,d, b), 


(c,d,aa) = —(aa,d,c) = —a(a,d,c) = a (c, d,a). 


The calculations performed prove the theorem 47.2. 
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§ 48. The concept of the oriented volume. 


Let a, b,c be a right triple of non-coplanar vectors in the 
space E. Let’s consider their mixed product (a,b,c). Due to the 
item 4) from the theorem 47.1 
the non-coplanarity of the vec- 
tors a, b, c means (a,b,c) 4 0, 
which in turn due to (45.1) im- 
plies [b,c] 4 0. 

Due to the item 4) from the 
theorem 39.1 the non-vanishing 
condition [b,c] 4 0 means b }{ c. 
Let’s build the geometric realiza- 
tions of the non-collinear vectors 
b and c at some common ini- 
tial point O and denote them 
paor ade OC. Then we 
build the geometric realization of the vector a at the same initial 


Fig. 48.1 


point O and denote it through a = OA’ . Let’s complement the 
vectors OA, OB, and OC up to a skew-angular parallelepiped 
as shown in Fig. 48.1. 

Let’s denote d = [b,c]. The vector d is perpendicular to the 
base plane of the parallelepiped, its length is calculated by the 
formula |d| = |b| |c| sina. It is easy to see that the length of d 
coincides with the base area of our parallelepiped, i.e. with the 
area of the parallelogram built on the vectors b and c: 


S = |d| = [b] |e] sina. (48.1) 


THEOREM 48.1. For any two vectors the length of their vector 
product coincides with the area of the parallelogram built on these 
two vectors. 


The fact formulated in the theorem 48.1 is known as the 
geometric interpretation of the vector product. 
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Now let’s return to Fig. 48.1. Applying the formula (45.1), for 
the mixed product (a,b,c) we derive 


(a,b,c) = (a, [b,c]) = (a,d) = |a] |d| cos ọ. (48.2) 


Note that |a| cosy is the length of the segment [OF], which 
coincides with the length of [AH]. The segment [AH] is parallel 
to the segment [OF] and to the vector d, which is perpendicular 
to the base plane of the skew-angular parallelepiped shown in 
Fig. 48.1. Hence the segment [AH] represents the height of this 
parallelepiped ad we have the formula 


h =|AA| = |a| cosy. (48.3) 
Now from (48.1), (48.3), and (48.2) we derive 
(a,b,c) = Sh =V, (48.4) 


i.e. the mixed product (a,b,c) in our case coincides with the 
volume of the skew-angular parallelepiped built on the vectors a, 
b, and c. 

In the general case the value of the mixed product of three 
non-coplanar vectors a, b, c can be either positive or negative, 
while the volume of a parallelepiped is always positive. Therefore 
in the general case the formula (48.4) should be written as 


V, ifa, b, cis a right triple 


f vectors; 
A of vectors; (48.5) 
—V, ifa, b, cis a left triple 


of vectors. 


DEFINITION 48.1. The oriented volume of an ordered triple of 
non-coplanar vectors is a quantity which is equal to the volume of 
the parallelepiped built on these vectors in the case where these 
vectors form a right triple and which is equal to the volume of 
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this parallelepiped taken with the minus sign in the case where 
these vectors form a left triple. 

The formula (48.5) can be written as a theorem. 


THEOREM 48.2. The mixed product of any triple of non-copla- 
nar vectors coincides with their oriented volume. 


DEFINITION 48.2. If e1, e2, e3 is a basis in the space E, then 
the oriented volume of the triple of vectors e1, e2, e3 is called 
the oriented volume of this basis. 


§49. Structural constants of the mixed product. 


Let e1, e2, e3 be some basis in the space E. Let’s consider 
various mixed products composed by the vectors of this basis: 


Cijk = (e;,€5, x). (49.1) 


DEFINITION 49.1. For any basis e1, e2, e3 in the space E 
the quantities cj, given by the formula (49.1) are called the 
structural constants of the mixed product in this basis. 


The formula (49.1) is similar to the formula (29.6) for the com- 
ponents of the Gram matrix. However, the structural constants 
of the mixed product c;;, in (49.1) constitute a three index array 
which cannot be laid into a matrix. 

An important property of the structural constants c;;, is 
their complete skew symmetry or complete antisymmetry. This 
property is expressed by the following equalities: 


Cijk = —Cjik, Cijk = —Cikj; Cijk = —Ckji» (49.2) 


The relationships (49.2) mean that under the transposition of any 
two indices the quantity c;;, changes its sign. These relationships 
are easily derived from (43.5) by applying the item 1) from the 
theorem 47.1 to the right hand side of (49.1). 
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The following relationships are an immediate consequence of 
the property of complete antisymmetry of the structural con- 
stants of the mixed product c;;,: 


Citk = —Ciik; Cijj = —Cigg, Ciji = Caja, (49.3) 


They are derived by substituting j = i, k = j, and k = i into 
(49.2). From the relationships (49.3) the vanishing condition for 
the structural constants ¢;;; is derived: 


Cijk = 0, if there are coinciding values (49.4) 
of the indices 2, 7, k. 


Now assume that the values of the indices 7, j, k do not coincide. 
In this case, applying the relationships (49.2), we derive 


Cijk = C123 if the indices i,j,k take the values (49.5) 
(1, 2,3), (2,3; 1), or (3, 12y; 
Cijk = —C123 if the indices i,j,k take the values (49.6) 
(1,3, 2), (3,2, 1), or (2, 1.3): 


The next step consists in comparing the relationships (49.4), 
(49.5), and (49.6) with the formula (43.5) that determines the 
Levi-Civita symbol ¢;;,. Such a comparison yields 


Cijk = C123 Eijk- (49.7) 


Note that c123 = (e1,€2,e3). This formula follows from (49.1). 
Therefore the formula (49.7) can be written as 


Cijk = (e1, €2, €3) Eijk. (49.8) 


THEOREM 49.1. In an arbitrary basis e1, e2, e3 the structural 
constants of the mixed product are expressed by the formula (49.8) 
through the only one constant — the oriented volume of the basis. 
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§50. Calculation of the mixed product through the 
coordinates of vectors in a skew-angular basis. 


Let e1, e2, e3 be a skew-angular basis in the space E. Let’s 
recall that the term skew-angular basis in this book is used as a 
synonym of an arbitrary basis (see Definition 29.1). Let a, b, and 
c be free vectors given by their coordinates in the basis e;, e2, e3: 


at b! c! 
a= || a? |f, b= |b? ||, c=|@ l. (50.1) 
a? b? ro 


The formulas (50.1) mean that we have the expansions 


3 3 3 
a=) de; b=} Ven c= der (50.2) 
i=1 j=1 


Let’s substitute (50.2) into the mixed product (a, b,c): 


(a,b,c) = (Sc ei, ye ej, 2d er). (50.3) 


=l 


In order to transform the formula (50.3) we apply the properties 
of the mixed product 2), 5), and 7) from the theorems 47.1 and 
47.2. Due to these properties we can bring the summation signs 
over i, j, and k outside the brackets of the mixed product: 


3 3 3 
(a,b,c) = 2, D X (a'ei, b’ ej, c" ex). (50.4) 


Now we apply the properties 3), 6), 8) from the theorems 47.1 and 
47.2. Due to these properties we can bring the numeric factors 
a’, b/, cf outside the brackets of the mixed product (50.4): 


3 3 3 
(a, b,c) =a ee c* (ei, €j, €p). (50.5) 
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The quantities (e;,e;,e,) are structural constants of the mixed 
product in the basis e1, e2, e3 (see (49.1)). Therefore the formula 
(50.5) can be written as follows: 


3 3 3 
(a, b,c) =e bÍ că Cijk. (50.6) 


i=1 j=1 k=1 


Let’s substitute (49.8) into (50.6) and take into account that the 
oriented volume (e;,€2,e3) does not depend on the summation 
indices 7, j, and k. Therefore the oriented volume (e1, e2,e3) can 
be brought outside the sums as a common factor: 


3 3 3 
(a, b,c) = (e1,€2,¢3) ) 9 Dab c" eije. (50.7) 


Note that the formula (50.7) differs from the formula (46.9) 
only by the extra factor (e1, e2, e3) in its right hand side. As for 
the formula (46.9), it is brought to the form (46.8) by applying 
the properties of the Levi-Civita symbol £;jẹ only. For this reason 
the formula (50.7) can be brought to the following form: 


a a ea? 
(a,b,c) = (e1,€2,e3) |b! b b]. (50.8) 
C g E 


DEFINITION 50.1. The formulas (50.6), (50.7), and (50.8) are 
called the formulas for calculating the mixed product through the 
coordinates of vectors in a skew-angular basis. 


§51. The relation of structural constants 
of the vectorial and mixed products. 


The structural constants of the mixed product are determined 
by the formula (49.1). Let’s apply the formula (45.1) in order to 
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transform (49.1). As a result we get the formula 
Cijk = (e;, [e;, ex]). (51.1) 


Now we can apply the formula (40.2). Let’s write it as follows: 
3 
[ej,ex] = X Ch, eg. (51.2) 
q=1 


Substituting (51.2) into (51.1) and taking into account the prop- 
erties 5) and 6) from the theorem 28.2, we derive 


3 
cyk = J Ol Ce) (51.3) 
q=1 


Let’s apply the formulas (29.6) and (30.1) to (51.3) and bring the 
formula (51.3) to the form 


3 
Cijk = >> Ch, Jqi- (51.4) 
q=1 


The following formula is somewhat more beautiful: 
3 
Gig = CE oi: (51.5) 
q=1 


In order to derive the formula (51.5) we apply the identity 
Cijk = Cjri to the left hand side of the formula (51.4). This 
identity is derived from (49.2). Then we perform the cyclic 
redesignation of indices i > k > j > i. 

The formula (51.5) is the first formula relating the structural 
constants of the vectorial and mixed products. It is important 
from the theoretical point of view, but this formula is of little 
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use practically. Indeed, it expresses the structural constants of 
the mixed product through the structural constants of the vector 
product. But for the structural constants of the mixed product 
we already have the formula (49.8) which is rather efficient. As 
for the structural constants of the vector product, we have no 
formula yet, except the initial definition (40.2). For this reason 
we need to invert the formula (51.5) and express Ci. through Cijk- 
In order to reach this goal we need some auxiliary information 
on the Gram matrix. 


THEOREM 51.1. The Gram matrix G of any basis in the space 
E is non-degenerate, i.e. its determinant is nonzero: det G # 0. 


THEOREM 51.2. For any basis e1, e2, e3 in the space E the 
determinant of the Gram matrix G is equal to the square of the 
oriented volume of this basis: 


det G = (e1, 2, e3)”. (51.6) 


The theorem 51.1 follows from the theorem 51.2. Indeed, a ba- 
sis is a triple of non-coplanar vectors. From the non-coplanarity 
of the vectors e1, €2, es; due to item 4) of the theorem 47.1 we 
get (e1, €2,€3) Æ 0. Then the formula (51.6) yields 


det G > 0, (51.7) 


while the theorem 51.1 follows from the inequality (51.7). 

I will not prove the theorem 51.2 right now at this place. This 
theorem is proved below in § 56. 

Let’s proceed to deriving consequences from the theorem 51.1. 
It is known that each non-degenerate matrix has an inverse 
matrix (see [7]). Let’s denote through G7! the matrix inverse to 
the Gram matrix G. In writing the components of the matrix 
G~! the following convention is used. 


DEFINITION 51.1. For denoting the components of the matrix 
G-t inverse to the Gram matrix G the same symbol g as for the 
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components of the matrix G itself is used, but the components of 
the inverse Gram matrix are enumerated with two upper indices: 


g! g! g!3 
Gu} = g?! g”? g” (51.8) 
go ar g? 


The matrices G and G~! are inverse to each other. Their 
product in any order is equal to the unit matrix: 


GG =1, G!.G=1. (51.9) 

From the regular course of algebra we know that each of the 

equalities (51.9) fixes the matrix G~! uniquely once the matrix 

G is given (see. [7]). Now we apply the matrix transposition 
operation to both sides of the matrix equalities (51.9): 

(Gag ty =i" =1. (51.10) 

Then we use the identity (A-B)' = B'-A" from the exercise 29.2 

in order to transform the formula (51.10) and take into account 

the symmetry of the matrix G (see Theorem 30.1): 
(GTQ =(G"y «G1, (51.11) 


The rest is to compare the equality (51.11) with the second 
matrix equality (51.9). This yields 


CG =g: (51.12) 


The formula (51.12) can be written as a theorem. 


THEOREM 51.3. For any basis e1, e2, e3 in the space E the 
matrix G~' inverse to the Gram matrix of this basis is symmetric. 
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In terms of the matrix components of the matrix (51.7) the 
equality (51.12) is written as an equality similar to (30.1): 


g = gt. (51.13) 


The second equality (51.9) is written as the following relation- 
ships for the components of the matrices (51.8) and (29.7): 


3 
ťa ong = (51.14) 
k=1 


Here 6° is the Kronecker symbol defined by the formula (23.3). 
Using the symmetry identities (51.13) and (30.1), we write the 
relationship (51.14) in the following form: 


3 
yong” =o). (51.15) 
k=1 


Now let’s multiply both ides of the equality (51.5) by g** and 
then perform summation over k in both sides of this equality: 


3 3 3 
dock = DD Ch Iag". (51.16) 
k=1 k=1q=1 


If we take into account the identity (51.15), then we can bring 
the formula (51.16) to the following form: 


3 3 
ee ON (51.17) 
k=1 q=1 


When summing over q in the right hand side of the equality 
(51.17) the index q runs over three values 1, 2,3. Then the 
Kronecker symbol 6; takes the values 0 and 1, the value 1 is 
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taken only once when q = s. This means that only one of three 
summands in the right hand side of (51.17) is nonzero. This 
nonzero summand is equal to Cf. Hence the formula (51.17) can 
be written in the following form: 


3 


Yang =. (51.18) 
k=1 


Let’s change the symbol k for q and the symbol s for k in (51.18). 
Then we transpose left and right hand sides of this formula: 


3 
C= > Giga”. (51.19) 
q=1 


The formula (51.19) is the second formula relating the struc- 
tural constants of the vectorial and mixed products. On the base 
of the relationships (51.5) and (51.19) we formulate a theorem. 


THEOREM 51.4. For any basis e1, e2, e3 in the space E the 
structural constants of the vectorial and mixed products in this 
basis are related to each other in a one-to-one manner by means 
of the formulas (51.5) and (51.19). 


§52. Effectivization of the formulas for 
calculating vectorial and mixed products. 


Let’s consider the formula (29.8) for calculating the scalar 
product in a skew-angular basis. Apart from the coordinates of 
vectors, this formula uses the components of the Gram matrix 
(29.7). In order to get the components of this matrix one 
should calculate the mutual scalar products of the basis vectors 
(see formula (29.6)), for this purpose one should measure their 
lengths and the angles between them (see Definition 26.1). No 
other geometric constructions are required. For this reason the 
formula (29.8) is recognized to be effective. 
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Now let’s consider the formula (41.5) for calculating the vector 
product in a skew-angular basis. This formula uses the structural 
constants of the vector product which are defined by means of 
the formula (40.2). According to this formula, in order to cal- 
culate the structural constants one should calculate the vector 
products [e;,e;] in its left hand side. For this purpose one should 
construct the normal vectors (perpendiculars) to the planes given 
by various pairs of basis vectors e;, e;. (see Definition 35.1). 
Upon calculating the vector products [e;,e;] one should expand 
them in the basis e1, e2, e3, which require some auxiliary geo- 
metric constructions (see formula (18.4) and Fig. 18.1). For this 
reason the efficiency of the formula (41.5) is much less than the 
efficiency of the formula (29.8). 

And finally we consider the formula (50.7) for calculating the 
mixed product in a skew-angular basis. In order to apply this 
formula one should know the value of the mixed product of the 
three basis vectors (e1, €2,€3). It is called the oriented volume of 
a basis (see Definition 48.2). Due to the theorem 48.2 and the 
definition 48.1 for this purpose one should calculate the volume 
of the skew-angular parallelepiped built on the basis vectors 
€1, €2, e3. In order to calculate the volume of this parallelepiped 
one should know its base area and its height. The area of its 
base is effectively calculated by the lengths of two basis vectors 
and the angle between them (see formula (48.1)). As for the 
height of the parallelepiped, in order to find it one should drop a 
perpendicular from one of its vertices to its base plane. Since we 
need such an auxiliary geometric construction, the formula (50.7) 
is less effective as compared to the formula (29.8) in the case of 
the scalar product. 

In order to make the formulas (41.5) and (50.7) effective we 
use the formula (51.6). It leads to the following relationship: 


(e1, €2, €3) = +vdetG. (52.1) 


The sign in (52.1) is determined by the orientation of a basis: 


§52. EFFECTIVIZATION OF THE FORMULAS ... 123 


VdetG if the basis e1, e2, e3 


is right; 
(e1, €2,€3) = (52.2) 
—VdetG if the basis e1, e2, e3 
is left. 


Let’s substitute the expression (52.1) into the formula for the 
mixed product (50.7). As a result we get 


(a,b,c) = +vdetG 


i=1 j 


3 
Se c ee (52.3) 


1 k=1 


Similarly, substituting (52.1) into the formula (50.8), we get 


at a? a? 
(a,b,c) =4+VdetG |b b |. (52.4) 
c! C2 c 


DEFINITION 52.1. The formulas (52.3) and (52.4) are called 
the effectivized formulas for calculating the mixed product throu- 
gh the coordinates of vectors in a skew-angular basis. 


In the case of the formula (41.5), in order to make it effec- 
tive we need the formulas (49.8) and (51.19). Substituting the 
expression (52.1) into these formulas, we obtain 


Cijk = +V det G Eijks (52.5) 


3 
Ch =4VdetG Y eizg 9. (52.6) 
q=1 


DEFINITION 52.2. The formulas (52.5) and (52.6) are called 
the effectivized formulas for calculating the structural constants 
of the mixed and vectorial products. 


Now let’s substitute the formula (52.6) into (41.5). This leads 
to the following formula for the vector product: 
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3 3 3 3 
la, b] = +v det G > `> y >. a’ bÍ Sija g ex. (52.7) 


DEFINITION 52.3. The formula (52.7) is called the effectivized 
formula for calculating the vector product through the coordi- 
nates of vectors in a skew-angular basis. 


§ 53. Orientation of the space. 


Let’s consider the effectivized formulas (52.1), (52.3), (52.4), 
(52.5), (52.6), and (52.7) from §52. Almost all information on 
a basis in these formulas is given by the Gram matrix G. The 
Gram matrix arising along with the choice of a basis reflects an 
important property of the space E — its metric. 


DEFINITION 53.1. The metric of the space E is its structure 
(its feature) that consists in possibility to measure the lengths of 
segments and the numeric values of angles in it. 


The only non-efficiency remaining in the effectivized formulas 
(52.1), (52.3), (52.4), (52.5), (52.6), and (52.7) is the choice of 
sign in them. As was said above, the sign in these formulas 
is determined by the orientation of a basis (see formula (52.2)). 
There is absolutely no possibility to determine the orientation of a 
basis through the information comprised in its Gram matrix. The 
matter is that the mathematical space E described by Euclid’s 
axioms (see [6]), comprises the possibility to distinguish a pair 
of bases with different orientations from a pair of bases with 
coinciding orientations. However, it does not contain any reasons 
for to prefer bases with one of two possible orientations. 

The concept of right triple of vectors (see Definition 34.2) and 
the possibility to distinguish right triples of vectors from left ones 
is due to the presence of people, it is due to their ability to 
observe vectors and compare their rotation with the rotation of 
clock hands. Is there a fundamental asymmetry between left and 
right not depending on the presence of people and on other non- 
fundamental circumstances? Is the space fundamentally oriented? 
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This is a question on the nature of the physical space E. Some 
research in the field of elementary particle physics says that such 
an asymmetry does exist. As for me, as the author of this book I 
cannot definitely assert that this problem is finally resolved. 


§54. Contraction formulas. 


Contraction formulas is a collection of four purely algebraic 
identities relating the Levi-Civita symbol and the Kronecker 
symbol with each other. They are proved successively one after 
another. Once the first identity is proved, each next identity is 
derived from the previous one. 


THEOREM 54.1. The Levi-Civita symbol and the Kronecker 
symbol are related by the first contraction formula 


HE oe 
en eu = oy oF on $ (54.1) 
a O 


PROOF. Let’s denote through fip" the right hand side of 
the first contraction formula (54.1). The transposition of any 
two lower indices in fik P is equivalent to the corresponding 
transposition of two columns in the matrix (54.1). It is known 
that the transposition of two columns of a matrix results in 
changing the sign of its determinant. This yield the following 
relationships for the quantities fik P, 


poe a see fR = ie fin Ss kje (54.2) 


ijk 


ijk 


The relationships (54.2) are analogous to the relationships (49.2). 
Repeating the considerations used in §49 when deriving the 
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formulas (49.4), (49.5), (49.6), from the formula (54.2) we derive 


0 if there are coinciding values 
of the indices i, j, k; 


193. if the values of the indices i, j, k 
mnp — form an even permutation of 4 
Sigh the numbers 1, 2, 3; oe 
—fi53°, if the values of the indices i, j, k 


form an odd permutation of the 
numbers 1, 2,3. 


Let’s compare the formula (54.3) with the formula (43.5) defining 
the Levi-Civita symbol. Such a comparison yields 


Like the initial quantities fip, the factor figg” in (54.4) is 
defined as the determinant of a matrix: 


dy” oy og 
ig Sor Oe os (54.5) 
OF -0,. 05 
Transposition of any pair of the upper indices in f/53” is equiv- 
alent to the corresponding transposition of rows in the matrix 
(54.5). Again we know that the transposition of any two rows of 
a matrix changes the sign of its determinant. Hence we derive 
the following relationships for the quantities (54.5): 


mnp _ nmp mnp mpn mnp 


_ pnm 
1233 ~ J123 >? 123 ~ J123 > 123 ~ J123 (54.6) 


The relationships (54.6) are analogous to the relationships (54.2), 
which are analogous to (49.2). Repeating the considerations used 
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in §49, from the relationships (54.6) we immediately derive the 
formula analogous to the formula (54.3): 


0 if there are coinciding values of 
the indices m,n, p; 


i23 if the values of the indices m, n, p 
mnp _ form an even permutation of the (54.7) 
numbers 1, 2, 3; l 


—fiz3 if the values of the indices m, n, p 
form an odd permutation of the 
numbers 1, 2,3. 


Let’s compare the formula (54.7) with the formula (43.5) defining 
the Levi-Civita symbol. This comparison yields 


mnp _ £123 -mnp 
123 — J123 € : (54.8) 


Let’s combine the formulas (54.4) and (54.8), i.e. we substitute 
(54.8) into (54.4). This substitution leads to the formula 
fie = Jiz E” Eijk- (54.9) 


Now we need to calculate the coefficient f/33 in the formula 


(54.9). Let’s recall that the quantity fip was defined as the 
right hand side of the formula (54.1). Therefore we can write 


a amar 


fin’ =| OP OF OR (54.10) 


P P 


and substitute i = m = 1, j = n = 2, k = p = 3 into (54.10): 


EOE ET 
= |6? 63 aja 
0 


0 
0} =1. (54.11) 
E B 83 1 


O o t © 
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Taking into account the value of the coefficient fi} given by 
the formula (54.11), we can transform the formula (54.9) to 

FELO = E Gijp, (54.12) 

Now the required contraction formula (54.1) is obtained as a con- 

sequence of (54.10) and (54.12). The theorem 54.1 is proved. 


THEOREM 54.2. The Levi-Civita symbol and the Kronecker 
symbol are related by the second contraction formula 


ae 6m 5m 
Soe eae ro (54.13) 
k=1 ò; ò; 


PROOF. The second contraction formula (54.13) is derived 
from the first contraction formula (54.1). For this purpose we 
substitute p = k into the formula (54.1) and take into account 
that dk = 1. This yields the formula 


om aro ap 
OF ô 1 


(3 


Let’s insert the summation over k to both sides of the formula 
(54.14). As a result we get the formula 


, p Jr om or 
k=1 k=1 ôk oF 1 


The left hand side of the obtained formula (54.15) coincides 
with the left hand side of the second contraction formula (54.13) 
to be proved. For this reason below we transform the right hand 
side of the formula (54.15) only. Let’s expand the determinant in 
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the formula (54.15) by its last row: 


3 3 om gm 
gs. = a ae 
Perens BUST an ap 
7 =- pm (54.16) 
ge [OO 6m ot ) 
7 | oP Op op on 


The summation over k in the right hand side of the formula 
(54.16) applies to all of the three terms enclosed in the round 
brackets. Expanding these brackets, we get 


3 3 6m 6m 
ee. 
k=1 : k=1 J ; i , (54.17) 
Am om oye 
k | ot k i j 
k=1 i k k=1| `t J 


The first sum in the right hand side of (54.17) contains the 
factor ôF. In performing the summation cycle over k this factor 
appears to be nonzero only once when k = i. For this reason 
only one summand of the first sum does actually survive. In this 
term k = i. Similarly, in the second sum also only one its term 
does actually survive, in this term k = j. As for the last sum in 
the right hand side of (54.17), the expression being summed does 
not depend on k. Therefore it triples upon calculating this sum. 
Taking into account that 6? = oH = 1, we get 


3 


mnk 
> E Eijk = 


k=1 


om gm 
J 


nE 


o a 


ge ar 


E 


The first determinant in the right hand side of the above formula 
differs from two others by the transposition of its columns. If 
we perform this transposition once more, it changes its sign and 
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we get a formula with three coinciding determinants. Then, 
collecting the similar terms, we derive 


Soe?" eip = (—1- 1+3) on ae (54.18) 
k=1 i J 


Now it is easy to see that the formula (54.18) leads to the 
required formula (54.13). The theorem 54.2 is proved. 


THEOREM 54.3. The Levi-Civita symbol and the Kronecker 
symbol are related by the third contraction formula 


3 3 
ťa oo (54.19) 


PROOF. The third contraction formula (54.19) is derived from 
the second contraction formula (54.13). For this purpose let’s 
substitute n = j into the formula (54.13), take into account that 
6! = 1, and insert the summation over j into both sides of the 
obtained equality. This yields the formula 


3 3 l 3 |m 5m 3 
yee = k i =X (0r — 5% 5) 
j=1 k=1 j=1 i j=1 


Upon expanding the brackets the sum over j in the right hand 
side of the above formula can be calculated explicitly: 


3 


3-8 3 
Soy et ae DED =30 n o =D, 


j=1 k=1 j=1 j=1 


It is easy to see that the calculations performed prove the for- 
mula (54.19) and the theorem 54.3 in whole. 
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THEOREM 54.4. The Levi-Civita symbol and the Kronecker 
symbol are related by the fourth contraction formula 


3 3 3 
SoS Sc eign = 6. (54.20) 


i=1 j=l k=1 


PROOF. The fourth contraction formula (54.20) is derived 
from the third contraction formula (54.19). For this purpose we 
substitute m = į into (54.19) and insert the summation over i 
into both sides of the obtained equality. This yields 


3 3 3 3 i 
D D > eiA i=) 1 = 2-3 = 6. 
i=1 j=1 k=1 j=l a 


The above calculations prove the formula (54.20), which com- 
pletes the proof of the theorem 54.4. 


§55. The triple product expansion 
formula and the Jacobi identity. 


THEOREM 55.1. For any triple of free vectors a, b, and c in 
the space E the following identity is fulfilled: 


a, [b, c]] = b (a,c) — c (a, b), (55.1) 


which is known as the triple product expansion formula’. 


PROOF. In order to prove the identity (55.1) we choose some 
right orthonormal basis e1, e2, e3 in the space E and let’s expand 
the vectors a, b, and c in this basis: 


3 3 3 
a=) dep b=) be; es) c ep (55.2) 
i=1 j=1 k=1 


1 In Russian literature the triple product expansion formula is known as 
the double vectorial product formula or the «BAC minus CAB> formula. 
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Let’s denote d = [b,c] and use the formula (44.1) for to calculate 
the vector d. We write this formula as 


3 


d= 3 ( Ss 3 bci sie) ek. (55.3) 


k=1 \i=1 j=l 


The formula (55.3) is the expansion of the vector d in the basis 
€1, €2, e3. Therefore we can get its coordinates: 


3. 3 
d = » >. bi cI Eijk. (55.4) 
i=1 j=1 


Now we again apply the formula (44.1) in order to calculate 
the vector |[a, [b,c]] = [a, d]. In this case we write it as follows: 


3 3 3 
ad= Sy 2" d tu (55.5) 


n=1 m=1 k=1 


Substituting (55.4) into the formula (55.5), we get 


sy ~ a et 


mpi 
a™ b'c Eijk Emkn en = 


(55.6) 


= 5 3 ye b’ ci (F curenen) en. 


k=1 


The upper or lower position of indices in the Levi-Civita symbol 
does not matter (see (43.5)). Therefore, taking into account 
(43.8), we can write Emgn = EF” = —e™”k and bring the 
formula (55.6) to the following form: 
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The sum enclosed in the round brackets in (55.7) coincides with 
the left hand side of the second contraction formula (54.13). 
Applying (54.13), we continue transforming the formula (55.7): 


3 3 3 m m 
adl=-S> or yrarvcr | J en 
n=1m=1 i=1 j=1 oj ð; 


3 3 
a5. Dy Soa bt cl (62 5" — 5 57) ep, = 


3 3 3 
-— Dee dD ab d 5 FF en = 


The result obtained can be written as follows: 


amel- (Sooo) (Era) - (He) (Hee) 


Now, let’s recall that the scalar product in an orthonormal 
basis is calculated by the formula (33.3). The formula (33.3) 
is easily recognized within the above relationship. Taking into 
account this formula, we get the equality 


la, [b, c]] = (a,c) > b'e; — (a,b) X c ej. (55.8) 


In order to bring (55.8) to the ultimate form (55.1) it is sufficient 
to find the expansions of the form (55.2) for b and c within the 
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above formula (55.8). As a result the formula (55.8) takes the 
form [a,[b,c]] = (a,c) b — (a,b)c, which in essential coincides 
with (55.1). The theorem 55.1 is proved. 


THEOREM 55.2. For any triple of free vectors a, b, and c in 
the space E the Jacobi identity is fulfilled: 


la, [b, c]] + [b, [e, a]] + [c, [a, b]] = 0. (55.9) 


PROOF. The Jacoby identity (55.9) is easily derived with the 
use of the triple product expansion formula (55.1): 


[a, [b, c]] = b (a,c) — c (a, b). (55.10) 


Let’s twice perform the cyclic redesignation of vectors a > b > 
c — a in the above formula (55.10). This yields 


c(b,a) — a(b,c), (55.11) 
[c, [a, b]] = a (c, b) — b (c, a). (55.12) 


By adding the above three equalities (55.10), (55.11), and (55.12) 
in the left hand side we get the required expression [a, [b,c]] + 
[b, [c, a]] + [c, [a,b]], while th right hand side of the resulting 
equality vanishes. The theorem 55.2 is proved. 


§56. The product of two mixed products. 


THEOREM 56.1. For any six free vectors a, b, c, x, y, and Z 
in the space E the following formula for the product of two mixed 
products (a,b,c) and (x,y,z) is fulfilled: 


(a,b,c) (x,y,z) =| (aiy) (by) (c,y)|. (56.1) 
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In order to prove the formula (56.1) we need two properties 
of the matrix determinants which are known as the linearity with 
respect to a row and the linearity with respect to a column (see 
[7]). The first of them can be expressed by the formula 


1 1 
Ly re Te 


Lo PO a ONS Laat os Laan 


LEMMA 56.1. If the k-th row of a square matrix is a linear 
combination of some r rows (non necessarily coinciding with the 
other rows of this matrix), then the determinant of such a matrix 
is equal to a linear combination of r separate determinants. 


The linearity with respect to a column is formulated similarly. 


LEMMA 56.2. If the k-th column of a square matrix is a linear 
combination of some r columns (non necessarily coinciding with 
the other columns of this matrix), then the determinant of such a 
matrix is equal to a linear combination of r separate determinants. 


I do not write the formula illustrating the lemma 56.2 since it 
does not fit the width of a page in this book. This formula can 
be obtained from the above formula by transposing the matrices 
in both its sides. 


PROOF OF THE THEOREM 56.1. Let e1, e2, es be some right 
orthonormal basis in the space E. Assume that the vectors a, b, 
c, x, y, and z are given by their coordinates in this basis. Let’s 
denote through L the left hand side of the formula (56.1): 


L = (a,b,c) (x,y, Z). (56.2) 
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In order to calculate L we apply the formula (46.9). In the case 
of the vectors a, b, c the formula (46.9) yields 


3 3 3 
(a, b,c) =} y y a bÍ că eij. (56.3) 
i=1 j=1 k=1 


In the case of the vectors x, y, z the formula (46.9) is written as 


3 3 3 
(x,y,z) = y tp y geye gh ger, (56.4) 


Note that raising indices of the Levi-Civita symbol in (56.4) does 
not change its values (see (43.5)). Now, multiplying the formulas 
(56.3) and (56.4), we obtain the formula for L: 


3 3 


3 3 
L=\ N` 3. D LS oe bÍ cë r” y” 2P e™™P cs. (56.5) 


The product e™”? €;;, in (56.5) can be replaced by the matrix 
determinant taken from the first contraction formula (54.1): 


3 ar 5m ap 
L= o bi cka™ yz? | OP OP OR |. (56.6) 
cae pe PF OF OR 


The next step consists in applying the lemmas 56.1 and 56.2 in 
order to transform the formula (56.6). Applying the lemma 56.2, 
we bring the sum over i and the associated factor a’ into the 
first column of the determinant. Similarly, we bring the sum 
over j and the factor bÍ into the second column, and finally, we 
bring the sum over k and its associated factor c* into the third 
column of the determinant. Then we apply the lemma 56.1 in 
order to distribute the sums over m, n, and p to the rows of the 
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determinant. Simultaneously, we distribute the associated factors 
z™, y”, and z? to the rows of the determinant. As a result of our 
efforts we get the following formula: 


3 3 3 3 3 
at a Soe eo ea as 
i=1 m=1 


j=l m=1 k=1m=1 


s 3 3 3 
Sey Deby sp SE Sek yop 
n=1 j=l n=1 


3 
1 
3 
=1 k=1 n=1 
3 


>, 

3 3 3 3 3 
D244 i aP oP DP ao se eo 
i=1 p=1 [1p 


Due to the Kronecker symbols the double sums in the above 
formula are reduced to single sums: 


3 3 
Soa’ x ye x yo r} 
i=l k=1 

3 3 3 
L= Xa y’ D DD yl. (56.7) 
i=1 j=1 


k=1 
3 3 
X aizi D yor 
i=1 j=1 k=1 


Let’s recall that our basis e1, e2, e3 is orthonormal. The scalar 
product in such a basis is calculated according to the formula 
(33.3). Comparing (33.3) with (56.7), we see that all of the sums 
within the determinant (56.7) are scalar products of vectors: 


L=\(ay) (b,y) (c,y)]. (56.8) 


138 CHAPTER I. VECTOR ALGEBRA. 


Now the formula (56.1) follows from the formulas (56.2) and 
(56.8). The theorem 56.1 is proved. 


The formula (56.1) is valuable for us not by itself, but due to 
its consequences. Assume that e1, e2, e3 is some arbitrary basis. 
Let’s substitute a = e1, b = e2, c = €3, X = €1, Y= €2, Z = 3 
into the formula (56.1). As a result we obtain 


(e1,e1) (e2,e1) (e3,e1) 
(e1, €2,e3)” = (e1, e2) (e2, e2) (e3, €2) 5 (56.9) 
(e1,e3) (e2,e3) (e3, e3) 
Comparing (56.9) with (29.7) and (29.6), we see that the 
matrix (56.9) differs from the Gram matrix G by transposing. 
If we take into account the symmetry of the Gram matrix (see 


theorem 30.1), than we find that they do coincide. Therefore the 
formula (56.9) is written as follows: 


(e1, €2,e3)” = det G. (56.10) 


The formula (56.10) coincides with the formula (51.6). This fact 
proves the theorem 51.2, which was unproved in § 51. 


CHAPTER II 


GEOMETRY OF LINES AND SURFACES. 


In this Chapter the tools of the vector algebra developed in 
Chapter I are applied for describing separate points of the space 
E and for describing some geometric forms which are composed 
by these points. 


§1. Cartesian coordinate systems. 


DEFINITION 1.1. A Cartesian coordinate system in the space 

E is a basis e1, €2, e3 complemented by some fixed point O of 
this space. The point O being a part of the Cartesian coordinate 
X system O, e1, €2, e3 is called the 

origin of this coordinate system. 


DEFINITION 1.2. The vector 
OX binding the origin O of a 
Cartesian coordinate system O, 
€1, €2, e3 with a point X of the 
space E is called the radius vec- 
tor of the point X in this coor- 
dinate system. 


Fig. 1.1 


The free vectors e1, €2, e3 be- 
ing constituent parts of a Carte- 
sian coordinate system O, e1, e2, e3 remain free. However they 
are often represented by geometric realizations attached to the 
origin O (see Fig. 1.1). These geometric realizations are extended 
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up to the whole lines which are called the coordinate axes of the 
Cartesian coordinate system O, e1, €2, €3. 


DEFINITION 1.3. The coordinates of a point X in a Cartesian 
coordinate system O, e1, e2, e3 are the coordinates of its radius 
vector ry = OX in the basis ej, e2, e3. 


Like other vectors, radius vectors of points are covered by the 
index setting convention (see Definition 20.1 in Chapter I). The 
coordinates of radius vectors are enumerated by upper indices, 
they are usually arranged into columns: 


a alee | (1.1) 


However, in those cases where a point X is represented by 
its coordinates these coordinates are arranged into a comma- 
separated row and placed within round brackets just after the 
symbol denoting the point X itself: 


X = X(x, 2°, x£’). (1.2) 


The upper position of indices in the formula (1.2) is inherited 
from the formula (1.1). 

Cartesian coordinate systems can be either rectangular or 
skew-angular depending on the type of basis used for defining 
them. In this book I follow a convention similar to that of the 
definition 29.1 in Chapter I. 


DEFINITION 1.4. In this book a skew-angular coordinate sys- 
tem is understood as an arbitrary coordinate system where no 
restrictions for the basis are imposed. 


DEFINITION 1.5. A rectangular coordinate system O, e1, eo, 
e3 whose basis is orthonormal is called a rectangular coordinate 
system with unit scales along the axes. 
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A remark. The radius vector of a point X written as Ox’ 
is a geometric vector whose position in the space is fixed. The 
radius vector of a point X written as ry is a free vector We 
can perform various operations of vector algebra with this vector: 
we can add it with other free vectors, multiply it by numbers, 
compose scalar products etc. But the vector rx has a special 
mission — to be a pointer to the point X. It can do this mission 
only if its initial point is placed to the origin. 


EXERCISE 1.1. Relying on the definition 1.1, formulate anal- 
ogous definitions for Cartesian coordinate systems on a line and 
on a plane. 


§ 2. Equations of lines and surfaces. 


Coordinates of a single fixed point X in the space E are 
three fixed numbers (three constants). If these coordinates are 
changing, we get a moving point that runs over some set within 
the space E. In this book we consider the cases where this set 
is some line or some surface. The case of a line differs from the 
case of a surface by its dimension or, in other words, by the 
number of degrees of freedom. Each surface is two-dimensional — 
a point on a surface has two degrees of freedom. Each line is 
one-dimensional — a point on a line has one degree of freedom. 

Lines and surfaces contain infinitely many points. Therefore 
they cannot be described by enumeration where each point is 
described separately. Lines and surfaces are described by means 
of equations. The equations of lines and surfaces are subdivided 
into several types. If the radius vector of a point enters an equa- 
tion as a whole without subdividing it into separate coordinates, 
such an equation is called a vectorial equation. If the radius 
vector of a point enters an equation through its coordinates, such 
an equation is called a coordinate equation. 

Another attribute of the equations is the method of imple- 
menting the degrees of freedom. One or two degrees of freedom 
can be implemented in an equation explicitly when the radius 


142 CHAPTER II. GEOMETRY OF LINES AND SURFACES. 


vector of a point is given as a function of one or two variables, 
which are called parameters. In this case the equation is called 
parametric. Non-parametric equations behave as obstructions de- 
creasing the number of degrees of freedom from the initial three 
to one or two. 


§3. A straight line on a plane. 


Assume that some plane q in the space E is chosen and fixed. 
Then the number of degrees of freedom of a point immediately 
decreases from three to two. In 
y order to study various forms of 
equations defining a straight line 
on the plane œ we choose some 
coordinate system O, e1, €2 on 
this plane. Then we can define 
the points of the plane a and the 
points of a line on it by means 
of their radius-vectors. 
1. Vectorial parametric 
equation of a line on a plane. 
Fig. 3.1 Let’s consider a line on a plane 
with some coordinate system O, 
€1, €2. Let X be some arbitrary point on this line (see Fig. 3.1) 
and let A be some fixed point of this line. The position of the 


point X relative to the point A is marked by the vector AX , 

while the position of the point A itself is determined by its radius 
ies 

vector ro = OA. Therefore we have 


r=OX =ro+4X. (3.1) 


Let’s choose and fix some nonzero vector a Æ 0 directed along 
the line in question. The vector AX is expressed through a by 
means of the following formula: 


AX =a-t. (3.2) 
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From the formulas (3.1) and (3.2) we immediately derive: 


r=rot+a-t. 3.3 
( 


DEFINITION 3.1. The equality (3.3) is called the vectorial 
parametric equation of a line on a plane. The constant vector 
a Æ 0 in it is a directional vector of the line, while the variable 
t is a parameter. The constant vector rp in (3.3) is the radius 
vector of an initial point. 


Each particular value of the parameter t corresponds to some 
definite point on the line. The initial point A with the radius 
vector rp is associated with the value t = 0. 

2. Coordinate parametric equations of a line on a plane. 
Let’s determine the vectors r, rg, and a in the vectorial paramet- 
ric equation (3.3) through their coordinates: 


a x a 
r= : ro =| ° |), a=|| 7 (3.4) 
y Yo ay 
Due to (3.4) the equation (3.3) is written as two equations: 
£ = To + azrt 
{ 0 T “s (3.5) 
Y = Yo + Qy t. 


DEFINITION 3.2. The equalities (3.5) are called the coordinate 
parametric equations of a straight line on a plane. The constants 
a, and ay in them cannot vanish simultaneously. 


3. Normal vectorial equation of a line on a plane. Let 
n Æ 0 be a vector lying on the plane a in question and being 
perpendicular to the line in question (see Fig. 3.1). Let’s apply 
the scalar multiplication by the vector n to both sides of the 
equation (3.3). As a result we get 


(r,n) = (ro, n) + (a,n) t. (3.6) 
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But a L n. Fort this reason the second term in the right hand 
side of (3.6) vanishes and the equation actually does not have the 
parameter t. The resulting equation is usually written as 


(r —ro,n) = 0. (3.7) 


The scalar product of two constant vectors rọ and n is a numeric 
constant. If we denote D = (ro, n), then the equation (3.7) can 
be written in the following form: 


(r,n)= D. (3.8) 


DEFINITION 3.3. Any one of the two equations (3.7) and (3.8) 
is called the normal vectorial equation of a line on a plane. The 
constant vector n Æ 0 in these equations is called a normal vector 
of this line. 


4. Canonical equation of a line on a plane. Let’s consider 
the case where a, # 0 and a, Æ 0 in the equations (3.5). In this 
case the parameter t can be expressed through x and y: 


2. p= (3.9) 
üz ay 
From the equations (3.9) we derive the following equality: 
Soo o SE (3.10) 
Ay dy 
If a, = 0, then the the first of the equations (3.5) turns to 
£ = T0. (3.11) 


If ay = 0, then the second equation (3.5) turns to 


Y = Yo. (3.12) 
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Like the equation (3.10), the equations (3.11) and (3.12) do not 
comprise the parameter t. 


DEFINITION 3.4. Any one of the three equalities (3.10), (3.11), 
and (3.12) is called the canonical equation of a line on a plane. 
The constants a, and a, in the equation (3.10) should be nonzero. 


5. The equation of a line passing through two given 
points on a plane. Assume that two distinct points A # B on a 
plane are given. We write their coordinates 


A = A(Zo, Yo); B= B(#1,y1). (3.13) 


—> 
The vector a = AB can be used for the directional vector of the 
line passing through the points A and B in (3.13). Then from 
(3.13) we derive the coordinates of a: 


Qr Tı — To 


a= = : (3.14) 
ay Yı — Yo 
Due to (3.14) the equation (3.10) can be written as 
Z- _ Yy- (325) 


Tı — Xo Yı — Yo 


The equation (3.15) corresponds to the case where zı # zo and 
yi Æ yo. If x; = xo, then we write the equation (3.11): 


f= ip =i: (3.16) 
If yı = yo, we write the equation (3.12): 
Y = Yo = y1. (3.17) 


The conditions zı = xp and yı = yo cannot be fulfilled simulta- 
neously since the points A and B are distinct, i.e. A Æ B. 
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DEFINITION 3.5. Any one of the three equalities (3.15), (3.16), 
and (3.17) is called the equation of a line passing through two 
given points (3.13) on a plane. 


6. General equation of a line on a plane. Let’s apply the 
formula (29.8) from Chapter I in order to calculate the scalar 
product in (3.8). In this particular case it is written as 


(r,n) = Dope nÍ gij, (3.18) 


where gij are the components of the Gram matrix for the basis 
€1, €2 on our plane (see Fig. 3.1). In Fig. 3.1 the basis e1, e2 is 
drawn to be rectangular. However, in general case it could be 
skew-angular as well. Let’s introduce the notations 


2 
Ni = 5 nd Jij- (3.19) 
j=1 


The quantities nt? and n? in (3.18) and in (3.19) are the coordi- 
nates of the normal vector n (see Fig. 3.1). 


DEFINITION 3.6. The quantities nı and no produced from the 
coordinates of the normal vector n by mens of the formula (3.19) 
are called the covariant coordinates of the vector n. 


Taking into account the notations (3.19), the formula (3.18) is 
written in the following simplified form: 


(aj= J rne (3.20) 


i=1 


Let’s recall the previous notations (3.4) and introduce new ones: 


A= ni, B= nə. (3.21) 
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Due to (3.4), (3.20), and (3.21) the equation (3.8) is written as 
Az+By-—-D=0. (3.22) 


DEFINITION 3.7. The equation (3.22) is called the general 
equation of a line on a plane. 


7. Double intersect equation of a line on a plane. Let’s 
consider a line on a plane that does not pass through the ori- 
gin and intersects with both of 


y the coordinate axes. These con- 
y ditions mean that D 4 0, A £ 0, 
b and B ¥ 0 in the equation (3.22) 


of this line. Through X and Y 
in Fig. 3.2 two intercept points 


e2 X are denoted: 
O 1 a z X= X (a, 0), 
(3.23) 
Y = Y (0,b). 
Fig. 3.2 


The quantities a and b in (3.23) 
are expressed through the constant parameters A, B, and D of 
the equation (3.22) by means of the following formulas: 


a= DJA, b= DJB. (3.24) 


The equation (3.22) itself in our present case can be written as 


T y 


—— +- =l 3.25 
D/A* D/B toa) 
If we take into account (3.24), the equality (3.25) turns to 
t y 
-+5 =l. 3.26 
ata (3-26) 
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DEFINITION 3.8. The equality (3.26) is called the double in- 
tercept equation of a line on a plane. 


The name of the equation (3.26) is due to the parameters a 
and b being the lengths of the segments [OX] and [OY] which 
the line intercepts on the coordinate axes. 


$4. A plane in the space. 


Assume that some plane œ in the space E is chosen and 

fixed. In order to study various equations determining this 

plane we choose some coordinate 

n system O, e1, e2, e3 in the space 

E. Then we can describe the 

points of this plane a by their 
radius vectors. 

1. Vectorial parametric 
equation of a plane in the spa- 
ce. Let’s denote through A some 
fixed point of the plane a (see 
Fig. 4.1) and denote through X 
some arbitrary point of this pla- 
ne. The position of the point X 
relative to the point A is marked 
Fig. 4.1 by the vector AX’ , while the po- 

sition of the point A is deter- 


ca 
mined by its radius vector rọ = OA. For this reason 
—> —> 
r=0X =ro+4X. (4.1) 


Let’s choose and fix some pair of non-collinear vectors a # b 
lying on the plane a. Such vectors constitute a basis on this 
plane (see Definition 17.1 from Chapter I). The vector AX’ is 
expanded in the basis of the vectors a and b: 


AX =a-t+b-r. (4.2) 
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Since X is an arbitrary point of the plane a, the numbers t and 
T in (4.2) are two variable parameters. Upon substituting (4.2) 
into the formula (4.1), we get the following equality: 


r=rota-t+b-r. (4.3) 


DEFINITION 4.1. The equality (4.3) is called the vectorial 
parametric equation of a plane in the space. The non-collinear 
vectors a and b in it are called directional vectors of a plane, 
while t and 7 are called parameters. The fixed vector rọ is the 
radius vector of an initial point. 


2. Coordinate parametric equation of a plane in the 
space. Let’s determine the vectors r, ro, a, b, in the vectorial 
parametric equation (4.3) through their coordinates: 


x £o Ay: by 
F=|yls ro=l vol, @=|/ay|], b=] byl; (4.4) 
z Zo az bz 


Due to (4.4) the equation (4.3) is written as three equations 


£ = To + az t + bzT, 
Y = Yo + ayt + byT, (4.5) 
z = zo + az t + bzT. 


DEFINITION 4.2. The equalities (4.5) are called the coordinate 
parametric equations of a plane in the space. The triples of 
constants dz, Qy, a, and bz, by, bz in these equations cannot be 
proportional to each other. 


3. Normal vectorial equation of a plane in the space. Let 
n Æ 0 be a vector perpendicular to the plane a (see Fig. 4.1). 
Let’s apply the scalar multiplication by the vector n to both sides 
of the equation (4.3). As a result we get 


(r,n) = (ro, n) + (a,n) t + (b, n)r7. (4.6) 
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But a L n and b L n. For this reason the second and the third 
terms in the right hand side of (4.6) vanish and the equation 
actually does not have the parameters t and 7. The resulting 
equation is usually written as 


(r —ro,n) = 0. (4.7) 


The scalar product of two constant vectors rọ and n is a numeric 
constant. If we denote D = (rọ, n), then the equation (4.7) can 
be written in the following form: 


(r,n) = D. (4.8) 


DEFINITION 4.3. Any one of the two equalities (4.7) and (4.8) 
is called the normal vectorial equation of a plane in the space. 
The constant vector n Æ 0 in these equations is called a normal 
vector of this plane. 


4. Canonical equation of a plane in the space. The vector 
product of two non-coplanar vectors a ¥ b lying on the plane 
a can be chosen for the normal vector n in (4.7). Substituting 
n = [a, b] into the equation (4.7), we get the relationship 


(r — ro, [a, b]) = 0. (4.9) 


Applying the definition of the mixed product (see formula (45.1) 
in Chapter I), the scalar product in the formula (4.9) can be 
transformed to a mixed product: 


(r — ro, a, b) = 0. (4.10) 


Let’s transform the equation (4.10) into a coordinate form. For 
this purpose we use the coordinate presentations of the vectors r, 
ro, a, and b taken from (4.4). Applying the formula (50.8) from 
Chapter I) and taking into account the fact that the oriented 
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volume of a basis (e1, €2,€3) is nonzero, we derive 


zT — To Y-VYo Z= 40 
ag Qy az = 0. (4.11) 


If we use not the equation (4.7), but the equation (4.8), then 
instead of the equation (4.10) we get the following relationship: 
(r,a,b) = D. (4.12) 


In the coordinate form the relationship (4.12) is written as 


£ Y 2 
s y agl =D, (4.13) 
bs by bz 


DEFINITION 4.4. Any one of the two equalities (4.11) and 
(4.13) is called the canonical equation of a plane in the space. The 
triples of constants az, ay, az and bz, by, bz in these equations 
should not be proportional to each other. 


DEFINITION 4.5. The equation (4.10) and the equation (4.12), 
where a }{ b, are called the vectorial forms of the canonical 
equation of a plane in the space. 


5. The equation of a plane passing through three given 
points. Assume that three points A, B, and C in the space E 
nit lying on a straight line are given. We write their coordinates 


A = A(x0, Yo; 20); 

B= B(21, 41, 21), (4.14) 
C = C (£2, Y2, 22). 
—s 


—> 
The vectors a = AB and b = AC can be chosen for the 
directional vectors of a plane passing through the points A, B, 
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and C. Then from the formulas (4.14) we derive 


Ge £1 — T0 bz T2 — XO 
a=|/ay |} =l ¥1—Yo ||, b= by = | Y2 — Yo ||- (4.15) 
Qz £1 — 20 bz Z2 — 20 


Due to (4.15) the equation (4.11) can be written as 


t—XO Y-Y #40 
zı — to Yi-Yo 41-20) =0. (4.16) 


T2 — To Y2-Yo %2— Z0 


If we denote through rı and rə the radius vectors of the points 
B and C from (4.14), then (4.15) can be written as 


a=r,— ro, b = rə — ro. (4.17) 


Substituting (4.17) into the equation (4.10), we get 


(r — ro, rı — ro, r2 — ro) = 0. (4.18) 
DEFINITION 4.6. The equality (4.16) is called the equation of 
a plane passing through three given points. 


DEFINITION 4.7. The equality (4.18) is called the vectorial 
form of the equation of a plane passing through three given points. 


6. General equation of a plane in the space. Let’s apply 
the formula (29.8) from Chapter I in order to calculate the scalar 
product (r, n) in the equation (4.8). This yields 


3 3 
(r,n) = Zar n! gij, (4.19) 
i=1 j=1 


where gij are the components of the Gram matrix for the basis 
€1, €2, €3 (see Fig. 4.1). In Fig. 4.1 the basis e1, e2, e3 is 
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drawn to be rectangular. However, in general case it could be 
skew-angular as well. Let’s introduce the notations 


3 
j=l 


The quantities n', n?, and n? in (4.19) and (4.20) are the 
coordinates of the normal vector n (see Fig. 4.1). 


DEFINITION 4.8. The quantities n1, n2, and ng produced from 
the coordinates of the normal vector n by mens of the formula 
(4.20) are called the covariant coordinates of the vector n. 


Taking into account the notations (4.20), the formula (4.19) is 
written in the following simplified form: 


2 


(rn) = Sor. (4.21) 


i=1 
Let’s recall the previous notations (4.4) and introduce new ones: 
A=n, B = ng, C = ng. (4.22) 

Due to (4.4), (4.21), and (4.22) the equation (4.8) is written as 
Az+By+Cy-D=0. (4.23) 
DEFINITION 4.9. The equation (4.23) is called the general 


equation of a plane in the space. 


7. Triple intercept equation of a plane in the space. Let’s 
consider a plane in the space that does not pass through the 
origin and intersects with each of the three coordinate axes. 
These conditions mean that D #40, A # 0, B40, and C #0 in 
(4.23). Through X, Y, and Z in Fig. 4.2 below we denote three 
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intercept points of the plane. Here are the coordinates of these 
three intercept points X, Y, and Z produced by our plane: 


X = X(a,0,0), 
Y = Y (0,b,0), (4.24) 
Z =Y(0,0,¢). 


The quantities a, b, and c in 
(4.24) are expressed through the 
constant parameters A, B, C, 
and D of the equation (4.23) by 
means of the formulas 


a = D/A, 
b= D/B, (4.25) 
Fig. 4.2 c= D/C. 


The equation of the plane (4.23) itself can be written as 


g y z 
— + — + — = 1 4.26 
D/A’ D/B D/C eee) 
If we take into account (4.25), the equality (4.26) turns to 
T. Y, zZ 
-+-=4-=1. 4.27 
a + b = ë een) 


DEFINITION 4.10. The equality (4.27) is called the triple in- 
tercept equation of a plane in the space. 


§5. A straight line in the space. 


Assume that some straight line a in the space E is chosen 
and fixed. In order to study various equations determining this 
line we choose some coordinate system O, e1, e2, e3 in the space 
E. Then we can describe the points of the line by their radius 
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vectors relative to the origin O. 

1. Vectorial parametric equation of a line in the space. 
Let’s denote through A some fixed point on the line (see Fig. 5.1) 
and denote through X an arbi- 
z trary point of this line. The 
position of the point X relative 
to the point A is marked by the 
vector AX’ , while the position of 
the point A itself is determined 
. R A 
by its radius vector ro = OA. 

Therefore we have 


X. 


r=r+A (5.1) 


Let’s choose and fix some non- 
Fig. 5.1 zero vector a Æ 0 directed along 
the line in question. The vector 
AX’ is expressed through the vector a by means of the formula 


AX =a-t. (5.2) 
From the formulas (5.1) and (5.2) we immediately derive 
r=rota-t. (5.3) 


DEFINITION 5.1. The equality (5.3) is called the vectorial 
parametric equation of the line in the space. The constant vector 
a # 0 in this equation is called a directional vector of the line, 
while t is a parameter. The constant vector ro is the radius vector 
of an initial point. 


Each particular value of the parameter t corresponds to some 
definite point on the line. The initial point A with the radius 
vector ro is associated with the value t = 0. 
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2. Coordinate parametric equations of a line in the space. 
Let’s determine the vectors r, ro, and a in the vectorial paramet- 
ric equation (5.3) through their coordinates: 


x To Ay 
r=|ly||, ro = || yo ||, a= || ay ||. (5.4) 
zZ Zo az 


Due to (5.4) the equation (5.3) is written as three equations: 


g= To + ürt 
Y = Yo + ayt, (5.5) 
z = zo + azt. 


DEFINITION 5.2. The equalities (5.5) are called the coordinate 
parametric equations of a line in the space. The constants az, dy, 
az in them cannot vanish simultaneously 


3. Vectorial equation of a line in the space. Let’s apply 
the vectorial multiplication by the vector a to both sides of the 
equation (5.3). As a result we get 


[r,a] = [ro, a] + [a, a] t. (5.6) 


Due to the item 4 of the theorem 39.1 in Chapter I the vector 
product [a,a] in (5.6) is equal to zero. For this reason the 
equation (5.6) actually does not contain the parameter t. This 
equation is usually written as follows: 


[r — ro, a] = 0. (5.7) 


The vector product of two constant vectors ro and a is a 
constant vector. If we denote this vector b = [ro,aj, then the 
equation of the line (5.7) can be written as 


[r,a] = b, where bla. (5.8) 
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DEFINITION 5.3. Any one of the two equalities (5.7) and (5.8) 
is called the vectorial’ equation of a line in the space. The 
constant vector b in the equation (5.8) should be perpendicular 
to the directional vector a. 


4. Canonical equation of a line in the space. Let’s consider 
the case where az # 0, ay #0, and a, Æ 0 in the equations (5.5). 
Then the parameter t can be expressed through x, y, and z: 

T — To Yy — Yo Z — £0 


t= t = 2 j=, 5.9 
ae ~ — (5.9) 


TO Y—Yo_ *%— %0 
Ay Qy az 


(5.10) 


If a, = 0, then instead of the first equation (5.9) from (5.5) we 
derive x = zo. Therefore instead of (5.10) we write 


a Y— Yo %— 40 
_ “äg ag 
y z 


(5.11) 
If a, = 0, then instead of the second equation (5.9) from (5.5) we 
derive y = yo. Therefore instead of (5.10) we write 


T= £0 Z= 20 
Y = Yo, = . 


5.12 
2. = (5.12) 


If a, = 0, then instead of the third equation (5.9) from (5.5) we 
derivez = zo. Therefore instead of (5.10) we write 
T — To yY — Yo 


Z = %0; Tas = ene (5.13) 
T y 


1 The terms «vectorial equation» and «vectorial parametric equation» are 
often confused and the term «vector equation» is misapplied to (5.3). 
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If az = 0 and a, = 0, then instead of (5.10) we write 


= T0, Y = Yo- (5.14) 


If ag = 0 and a, = 0, then instead of (5.10) we write 
T= ti; z= žo: (5.15) 


If ay = 0 and a, = 0, then instead of (5.10) we write 


Y = Yo; Z = Zy (5.16) 


DEFINITION 5.4. Any one of the seven pairs of equations 
(5.10), (5.11), (5.12), (5.13), (5.14), (5.15), and (5.16) is called 
the canonical equation of a line in the space. 


5. The equation of a line passing through two given 
points in the space. Assume that two distinct points A # B in 
the space are given. We write their coordinates 


A = A(20, Yo, 20), B = hig Wier): (5.17) 


—> 

The vector a = AB can be used for the directional vector of the 
line passing through the points A and B in (5.17). Then from 
(5.17) we derive the coordinates of a: 


Ay Ti = Xo 
a= || ay || = || yı — yo || - (5.18) 
az 21 — Zo 


Due to (5.18) the equations (5.10) can be written as 


Zoto o YT a Ta (5.19) 
Tı — To Yı — Yo Z1 — 20 
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The equations (5.19) correspond to the case where the inequali- 


ties zı Æ £o, y1 # Yo, and zı Æ 2 are fulfilled. 


If x; = xo, then instead of (5.19) we write the equations 


_ _ Y—¥o _ *%— %0 
p= o= 215 SS SS 
Yı — Yo 21 — 20 


If y1 = yo, then instead of (5.19) we write the equations 


T= LO Z= £0 
Y = Yo 5 Y1, a 
Tı — To 21 — 20 
If z1 = zo, then instead of (5.19) we write the equations 


z— zo Y-Y 


Z = 20 = 21; = A 
Tı — To Yı — Yo 


If zı = zo and yı = yo, then instead of (5.19) we write 


T = To = T1, y= Yoa = Yi- 
If zı = zo and z1 = 2, then instead of (5.19) we write 
T = To = Tı, 2 = 20 = 21. 


If y1 = yo and z; = Zo, then instead of (5.19) we write 


Y = yo 5 Yı, Zz = 20 = 21. 


(5.20) 


(5.21) 


(5.22) 


(5.23) 


(5.24) 


(5.25) 


The conditions 7; = £o, Yı = Yo, and zı = zg cannot be fulfilled 


simultaneously since A # B. 


DEFINITION 5.5. Any one of the seven pairs of equalities 
(5.19), (5.20), (5.21), (5.22), (5.23), (5.24), and (5.25) is called the 
equation of a line passing through two given points A and B with 


the coordinates (5.17). 
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6. The equation of a line in the space as the intersection 
of two planes. In the vectorial form the equations of two 
intersecting planes can be written as (4.8): 


(r, Hy) = Dı, (r, n2) = Də. (5.26) 


For the planes given by the equations (5.26) do actually intersect 
their normal vectors should be non-parallel: nı } no. 

In the coordinate form the equations of two intersecting planes 
can be written as (4.23): 


Aye Bıy +Cıy-— Dı =0, 
Ag & Boy Coy Dz = 0. 


(5.27) 


DEFINITION 5.6. Any one of the two pairs of equalities (5.26) 
and (5.27) is called the equation of a line in the space obtained 
as the intersection of two planes. 


§6. Ellipse. Canonical equation of an ellipse. 


DEFINITION 6.1. An ellipse is a set of points on some plane 
the sum of distances from each of which to some fixed points Fi 
and F> of this plane is a constant which is the same for all points 
of the set. The points F, and F> are called the foci of the ellipse. 


Assume that an ellipse with 
the foci Fı and Fə is given. 
Let’s draw the line connecting 
the points Fı and F> and choose 
this line for the z-axis of a co- 
ordinate system. Let’s denote 
through O the midpoint of the 
segment |F; F>] and choose it for 
the origin. We choose the second 
coordinate axis (the y-axis) to 
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be perpendicular to the x-axis on the ellipse plane (see Fig. 6.1). 
We choose the unity scales along the axes. This means that the 
basis of the coordinate system constructed is orthonormal. 

Let M = M(x,y) be some arbitrary point of the ellipse. 
According to the definition 6.1 the sum |MF,| + |M Fy] is equal 
to a constant which does not depend on the position of M on the 
ellipse. Let’s denote this constant through 2a and write 


|MF,| + |MFy| = 2a. (6.1) 


The length of the segment |F; Fə] is also a constant. Let’s 
denote this constant through 2c. This yields the relationships 


|FO| = |O F>| =C, |F Fol = 2c. (6.2) 


From the triangle inequality |F\F2| < |MF| + |MF)| we derive 
the following inequality for the constants c and a: 


c<Sa. (6.3) 


The case c = a in (6.3) corresponds to a degenerate ellipse. In 
this case the triangle inequality |F Fo| < |MFiı| + |M Fə| turns 
to the equality |F Fo| = |M F| + |M Fə|, while the MFF itself 
collapses to the segment |F; F>]. Since M € |F; F>], a degenerate 
ellipse with the foci Fı and F> is the segment [Fi F2]. The case 
of a degenerate ellipse is usually excluded by setting 


Us ex a. (6.4) 


The formulas (6.2) determine the foci F} and F> of the ellipse 
in the coordinate system we have chosen: 


Fi = Fi (—c,0), Fy = F(c, 0). (6.5) 


Having determined the coordinates of the points F, and Fy and 
knowing the coordinates of the point M = M(x,y), we derive the 
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following relationships for the segments [M F;] and [M F>]: 


|MFi| = Vy? + (x+¢)?, 


6.6 
|MF2| = Vy? + (@— ¢)?. — 


Derivation of the canonical equation of an ellipse. Let’s 
substitute (6.6) into the equality (6.1) defining the ellipse: 


(c+ c)?+ /y24+(a4-—c)? = 2a. (6.7) 


Then we move one of the square roots to the right hand side of 
the formula (6.7). As a result we derive 


Vy? + (atc)? = 2a — yy? +(x- c). (6.8) 
Let’s square both sides of the equality (6.8): 


y? + (z +c) = 4a — 


6.9 
— 4a yy? + (z -— c)? +y? + (2 -— c}. (6.9) 


Upon expanding brackets and collecting similar terms the equal- 
ity (6.9) can be written in the following form: 


4a s/y? + (z —c)? = 4a” —4Aze. (6.10) 


Let’s cancel the factor four in (6.10) and square both sides of this 
equality. As a result we get the formula 


a? (yY? + (z —c)*) =at - 2a? reH r? e. (6.11) 


Upon expanding brackets and recollecting similar terms the 
equality (6.11) can be written in the following form: 


r’ (a? —¢*) +y a = a? (a? — ee). (6.12) 
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Both sides of the equality (6.12) contain the quantity a? — c?. 


Due to the inequality (6.4) this quantity is positive. For this 
reason it can be written as the square of some number b > 0: 


-e =b. (6.13) 
Due to (6.13) the equality (6.12) can be written as 
a? b? + y? a? a (6.14) 


Since b > 0 and a > 0 (see the inequalities (6.4)), the equality 
(6.14) transforms to the following one: 


Z +5 l. (6.15) 


DEFINITION 6.2. The equality (6.15) is called the canonical 
equation of an ellipse. 


THEOREM 6.1. For each point M = M(x,y) of the ellipse de- 
termined by the initial equation (6.7) its coordinates obey the 
canonical equation (6.15). 


The proof of the theorem 6.1 consists in the above calculations 
that lead to the canonical equation of an ellipse (6.15). This 
canonical equation yields the following important inequalities: 


|z| <a, ly] < b. (6.16) 


THEOREM 6.2. The canonical equation of an ellipse (6.15) is 
equivalent to the initial equation (6.7). 


PROOF. In order to prove the theorem 6.2 we transform the 
expressions (6.6) relying on (6.15). From (6.15) we derive 


Teb, (6.17) 
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Substituting (6.17) into the first formula (6.6), we get 


p2 
|MFi| = 4/0? -— 2? +2? +22c +e? [= 
a 


2b 
=,/° z 2? lee (b? +e). 
a 


(6.18) 


Now we take into account the relationship (6.13) and write the 
equality (6.18) in the following form: 


2 2 2 
IMF\| =a? +20c+Sa2=/(5 =) (6.19) 
a a 


Upon calculating the square root the formula (6.19) yields 


la? + ca 


|MF,| = (6.20) 


From the inequalities (6.4) and (6.16) we derive a? + cx > 0. 
Therefore the modulus signs in (6.20) can be omitted: 


2 
e a ce (6.21) 
a a 


In the case of the second formula (6.6) the considerations similar 
to the above ones yield the following result: 
a? — cg Cx 


|MF| = sgat, (6.22) 
a a 


Now it is easy to see that the equation (6.7) written as |MF\| + 
|MF,| = 2a due to (6.6) is immediate from (6.21) and (6.22). 
The theorem 6.2 is proved. 


Let’s consider again the inequalities (6.16). The coordinates of 
any point M of the ellipse obey these inequalities. The first of the 
inequalities (6.16) turns to an equality if M coincides with A or if 
M coincides with C (see Fig. 6.1). The second inequality (6.16) 
turns to an equality if M coincides with B or if M coincides D. 
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DEFINITION 6.3. The points A, B, C, and D in Fig. 6.1 are 
called the vertices of an ellipse. The segments [AC] and [BD] 
are called the azes of an ellipse, while the segments [OA], [OB], 
[OC], and [OD] are called its semiazes. 


The constants a, b, and c obey the relationship (6.13). From 
this relationship and from the inequalities (6.4) we derive 


0<b<a. (6.23) 


DEFINITION 6.4. Due to the inequalities (6.23) the semiaxis 
[OA] in Fig. 6.1 is called the major semiazis of the ellipse, while 
the semiaxis [OB] is called its minor semiazis. 


DEFINITION 6.5. A coordinate system O, e1, €2 with an or- 
thonormal basis e1, e2 where an ellipse is given by its canonical 
equation (6.15) and where the inequalities (6.23) are fulfilled is 
called a canonical coordinate system of this ellipse. 


$87. The eccentricity and directrices of 
an ellipse. The property of directrices. 


The shape and sizes of an ellipse are determined by two 
constants a and b in its canonical equation (6.15). Due to the 
relationship (6.13) the constant b can be expressed through the 
constant c. Multiplying both constants a and c by the same 
number, we change the sizes of an ellipse, but do not change its 
shape. The ratio of these two constants 


Pem =. (7.1) 


is responsible for the shape of an ellipse. 


DEFINITION 7.1. The quantity £ defined by the relationship 
(7.1), where a is the major semiaxis and c is the half of the 
interfocal distance, is called the eccentricity of an ellipse. 
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The eccentricity (7.1) is used in order to define one more 
numeric parameter of an ellipse. It is usually denoted through d: 


daa . (7.2) 


DEFINITION 7.2. On the plane of an ellipse there are two lines 
parallel to its minor axis and placed at the distance d given by 
the formula (7.2) from its center. These lines are called directrices 

of an ellipse. 


M Each ellipse has two foci and 

two directrices. Each directrix 

47 | ™ a has the corresponding focus of 
it. This is that of two foci which 

is more close to the directrix in 

question. Let M = M(x,y) be 


some arbitrary point of an el- 
lipse. Let’s connect this point 
Fig. 7.1 with the left focus of the ellipse 

Fı and drop the perpendicular 

from it to the left directrix of the ellipse. Let’s denote through 


H, the base of such a perpendicular and calculate its length: 


|MH;| = |x — (—d)| = |d + a. (7.3) 
Taking into account (7.2), the formula (7.3) can be brought to 


_ |@+ee| 


\MHy| = Es +2| (7.4) 


c 

The length of the segment MF, was already calculated above. 
Initially it was given by one of the formulas (6.6), but later the 
more simple expression (6.20) was derived for it: 


2 
|MF,| = lat + eal (7.5) 
a 
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If we divide (7.5) by (7.4), we obtain the following relationship: 


|M F| Cc 
Soa. 7.6 
MH a ee} 


The point M can change its position on the ellipse. Then 
the numerator and the denominator of the fraction (7.6) are 
changed, but its value remains unchanged. This fact is known as 
the property of directrices. 


THEOREM 7.1. The ratio of the distance from some arbitrary 
point M of an ellipse to its focus and the distance from this point 
to the corresponding directrix is a constant which is equal to the 
eccentricity of the ellipse. 


§ 8. The equation of a tangent line to an ellipse. 


Let’s consider an ellipse given by its canonical equation (6.15) 
in its canonical coordinate system (see Definition 6.5). Let’s draw 
a tangent line to this ellipse and denote through M = M (zo, yo) 
its tangency point (see Fig. 8.1). 
Our goal is to write the equation 
of a tangent line to an ellipse. 

An ellipse is a curve com- 
posed by two halves — the up- 
per half and the lower half. Any 
one of these two halves can be 
treated as a graph of a function 


y = f(z) (8.1) 


Fig. 8.1 


with the domain (—a,a). The 
equation of a tangent line to the graph of a function is given by 
the following well-known formula (see [9]): 


Yy = yo + f'(xo) (x — 20). (8.2) 
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In order to apply the formula (8.2) to an ellipse we need to 
calculate the derivative of the function (8.1). Let’s substitute the 
expression (8.1) for y into the formula (6.15): 


z y Ue 


The equality (8.3) is fulfilled identically in x. Let’s differentiate 
the equality (8.3) with respect to x. This yields 


=i (8.3) 


— + 1 = 0. (8.4) 


Let’s apply the formula (8.4) for to calculate the derivative f'(x): 
b? x 
a? f(x) 


In order to substitute (8.5) into the equation (8.2) we change x 
for zo and f(x) for f(x) = yo. As a result we get 


f(z) = (8.5) 


(8.6) 


Let’s substitute (8.6) into the equation of the tangent line 
(8.2). This yields the following relationship 


b r 
y = yo — -z (z — 20). (8.7) 
a” Yo 


Eliminating the denominator, we write the equality (8.7) as 
a? yY yo +b? zzo =a? yg +b? 22. (8.8) 
Now let’s divide both sides of the equality (8.8) by a? b?: 


TTo Yyo zre y (8.9) 
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Note that the point M = M(zo, yo) is on the ellipse. Therefore 
its coordinates satisfy the equation (6.15): 


Tpi (8.10) 


Taking into account (8.10), we can transform (8.9) to 


TTo  Yyo 


This is the required equation of a tangent line to an ellipse. 


THEOREM 8.1. For an ellipse determined by its canonical equa- 
tion (6.15) its tangent line that touches this ellipse at the point 
M = M(zo,yo) is given by the equation (8.11). 


The equation (8.11) is a particular case of the equation (3.22) 
where the constants A, B, and D are given by the formulas 


To 
a?’ 


_ Yo 


A= — Fa 


B D=1. (8.12) 
According to the definition 3.6 and the formulas (3.21) the con- 
stants A and B in (8.12) are the covariant components of the 
normal vector for the tangent line to an ellipse. The tangent 
line equation (8.11) is written in a canonical coordinate system 
of an ellipse. The basis of such a coordinate system is orthonor- 
mal. Therefore the formula (3.19) and the formula (32.4) from 
Chapter I yield the following relationships: 


A=n,=n', B=m =n. (8.13) 


THEOREM 8.2. The quantities A and B in (8.12) are the co- 
ordinates of the normal vector n for the tangent line to an ellipse 
which is given by the equation (8.11). 


The relationships (8.13) prove the theorem 8.2. 
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§9. Focal property of an ellipse. 


The term focus is well known in optics. It means a point where 
light rays converge upon refracting in lenses or upon reflecting in 
curved mirrors. In the case of an ellipse let’s assume that it is 
manufactured of a thin strip of some flexible material and assume 
that its inner surface is covered with a light reflecting layer. For 
such a materialized ellipse one can formulate the following fo- 
cal property. 


THEOREM 9.1. A light ray emitted from one of the foci of an 
ellipse upon reflecting on its inner surface passes through the other 
focus of this ellipse. 


THEOREM 9.2. The perpendicular to a tangent line of an el- 
lipse drawn at the tangency point is a bisector in the triangle 
composed by the tangency point and two foci of the ellipse. 


The theorem 9.2 is a geomet- 
ric version of the theorem 9.1. 
These theorems are equivalent 
due to the reflection law say- 
ing that the angle of reflection is 
equal to the angle of incidence. 


PROOF OF THE THEOREM 9.2. 
Let’s choose an arbitrary point 
M = M(ao,yo) on the ellipse 
and draw the tangent line to the 
ellipse through this point as shown in Fig. 9.1. Then we draw 
the perpendicular [MN] to the tangent line through the point 
M. The segment [MN] is directed along the normal vector of 
the tangent line. In order to prove that this segment is a bisector 
of the triangle Fı M Fù it is sufficient to prove the equality 


Fig. 9.1 


cos(F{ MN) = cos( F, MN). (9.1) 


The cosine equality (9.1) is equivalent to the following equality 
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for the scalar products of vectors: 


—> —> 
(MF;,n) _ (M F2,n) (9 2) 
|M Fi | MF] ` 


The coordinates of the points F, and F in a canonical co- 
ordinate system of the ellipse are known (see formulas (6.5)). 
The coordinates of the point M = M(z£o, yo) are also known. 


Therefore we can find the coordinates of the vectors MF, and 
M Fy, used in the above formula (9.2): 


, MF, = |17” (9.3) 


—Y¥o 


aad == tý 
—Y¥o 


The coordinates of the normal vector n of the tangent line in 
Fig. 9.1 are given by the formulas (8.12) and (8.13): 


n= ; (9.4) 


Using (9.3) and (9.4), we apply the formula (33.3) from Chapter I 
for calculating the scalar products in (9.2): 


2 2 2 2 
——> , —€%9—-2 Yð —C® 2% Y 
ae a a b’ 
2 2 2 2 (9.5) 
—> , _ Cto— ző Yð CTo 2 Yô 
Mine a e ae e 


The point M = M(zo,yo) lies on the ellipse. Therefore its 
coordinates should satisfy the equation of the ellipse (6.15): 


2y], (9.6) 
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Due to (9.6) the formulas (9.5) simplify and take the form 


2 
——> = 
(MF;,n) = ae 
“ : . (9.7) 
——> C £o af — c £o 


In order to calculate the denominators in the formula (9.2) we 
use the formulas (6.21) and (6.22). In this case, when applied to 
the point M = M (xo, yo), they yield 


a? + Czo C To 


2 
a — 
|M F| = —, |M Fo| = ———— (9.8) 
a a 


From the formulas (9.7) and (9.8) we easily derive the equalities 


(MF;,n) = 1 (M F2,n) a. 
MF] a’ IMF, a 


which proves the equality (9.2). As a result the theorem 9.2 and 
the theorem 9.1 equivalent to it both are proved. 


§10. Hyperbola. Canonical equation of a hyperbola. 


DEFINITION 10.1. A hyperbola is a set of points on some plane 
the modulus of the difference of distances from each of which to 
some fixed points Fı and Fp of this plane is a constant which 
is the same for all points of the set. The points Fı and Fp are 
called the foci of the hyperbola. 


Assume that a hyperbola with the foci F, and Fə is given. 
Let’s draw the line connecting the points Ff and Fp and choose 
this line for the z-axis of a coordinate system. Let’s denote 
through O the midpoint of the segment [FF] and choose it 
for the origin. We choose the second coordinate axis (the y- 
axis) to be perpendicular to the x-axis on the hyperbola plane 


§10. HYPERBOLA. 173 


y (see Fig. 10.1). We choose the 
unity scales along the axes. This 
M means that the basis of the co- 
ordinate system constructed is 
orthonormal. 
FR l/c OA\F Let M = M(x,y) be some ar- 
bitrary point of the hyperbola 
in Fig. 10.1. According to the 
definition of a hyperbola 10.1 
the modulus of the difference 
Fig. 10.1 || MFi| — |MFə|| is a constant 
which does not depend on a lo- 
cation of the point M on the hyperbola. We denote this constant 
through 2a and write the formula 


|| MF, | — |MF|| = 2a. (10.1) 
According to (10.1) the points of the hyperbola are subdivided 
into two subsets. For the points of one of these two subsets the 
condition (10.1) is written as 


|MF,| —|MFy| = 2a. (10.2) 


These points constitute the right branch of the hyperbola. For 
the points of the other subset the condition (10.1) is written as 


|MF2| — |M F| = 2a. (10.3) 
These points constitute the left branch of the hyperbola. 


The length of the segment [F) F 2] is also a constant. Let’s 
denote this constant through 2c. This yields 


|FO| = [OF] = C, |F Fol = 2G: (10.4) 


From the triangle inequality |F\F2| > || MF; | — |MF)||, from 
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(10.2), from (10.3), and from (10.4) we derive 
ca. (10.5) 


The case c = a in (10.5) corresponds to a degenerate hyper- 
bola. In this case the inequality |F; Fə| > || MF; | — |M F>|| 
turns to the equality |F\F2| = ||MF\| — |MF || which dis- 
tinguishes two subcases. In the first subcase the equality 
|F F2| = ||MF;| — |M Fz|| is written as 


|FiFo| = |MF;]| — |M F3]. (10.6) 


The equality (10.6) means that the triangle F} M F» collapses into 
the segment [F M], i.e. the point M lies on the ray going along 
the z-axis to the right from the point F> (see Fig. 10.1). 

In the second subcase of the case c = a the equality |F; Fo| = 
= ||MF;| — |M F|] is written as follows 


[Fi P| = |M F>| — |M F]. (10.7) 


The equality (10.7) means that the triangle F} M F> collapses into 
the segment [F M], i.e. the point M lies on the ray going along 
the z-axis to the left from the point Fı (see Fig. 10.1). 

As we see considering the above two subcases, if c = a the 
degenerate hyperbola is the union of two non-intersecting rays 
lying on the z-axis and being opposite to each other. 

Another form of a degenerate hyperbola arises if a = 0. In this 
case the branches of the hyperbola straighten and, gluing with 
each other, lie on the y-axis. 

Both cases of degenerate hyperbolas are usually excluded from 
the consideration by means of the following inequalities: 


gma0: (10.8) 


The formulas (10.4) determine the coordinates of the foci Fi 
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and F of our hyperbola in the chosen coordinate system: 
Fi = Fı(—c,0), Fy = F(c,0). (10.9) 


Knowing the coordinates of the points F, and F> and knowing 
the coordinates of the point M = M(x,y), we write the formulas 


|MFi| = Vy? + (x + ¢)?, 


(10.10) 
|MF2| = Vy? + (z — c}. 

Derivation of the canonical equation of a hyperbola. 
As we have seen above, the equality (10.1) defining a hyperbola 
breaks into two equalities (10.2) and (10.3) corresponding to the 
right and left branches of the hyperbola. Let’s unite them back 
into a single equality of the form 


|MF,| —|MFy| =+2a. (10.11) 


Let’s substitute the formulas (10.10) into the equality (10.11): 


y2 + (x +c)? — yy? + (z -— c)? =+2a. (10.12) 


Then we move one of the square roots to the right hand side of 
the formula (10.12). As a result we derive 


y? + (x+) =+2a4 yy? + (x4-c)?. (10.13) 


Squaring both sides of the equality (10.13), we get 


y? + (z +c)? = 4a? + 


10.14 
-4a y2 + (z —c)? +y? 4+ (r-e). l ) 


Upon expanding brackets and collecting similar terms the equal- 
ity (10.14) can be written in the following form: 


F4ar/y? + (z —c)? = 4a? — 4 xc. (10.15) 
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Let’s cancel the factor four in (10.15) and square both sides of 
this equality. As a result we get the formula 


a? (Y? + (w@—c)”) = a4 -2e? wet 27’. (10.16) 


Upon expanding brackets and recollecting similar terms the 
equality (10.16) can be written in the following form: 


x” (a® —¢*) + y? a? = a? (a? — ec”). (10.17) 


An attentive reader can note that the above calculations are 
almost literally the same as the corresponding calculations for 
the case of an ellipse. As for the resulting formula (10.17), it 
coincides exactly with the formula (6.12). But, nevertheless, 
there is a difference. It consists in the inequalities (10.8), which 
are different from the inequalities (6.4) for an ellipse. 

Both sides of the equality (10.17) comprises the quantity 
a? — c*. Due to the inequalities (10.8) this quantity is negative. 
For this reason the quantity a? — c? can be written as the square 
of some positive quantity b > 0 taken with the minus sign: 


a -e = b’. (10.18) 
Due to (10.18) the equality (10.17) can be written as 
ap bP +y a = a o’. (10.19) 


Since b > 0 and a > 0 (see inequalities (10.8)), the above equality 
(10.19) transforms to the following one: 


a (10.20) 


DEFINITION 10.2. The equality (10.20) is called the canonical 
equation of a hyperbola. 
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THEOREM 10.1. For each point M(x,y) of the hyperbola de- 
termined by the initial equation (10.12) its coordinates satisfy the 
canonical equation (10.20). 


The above derivation of the canonical equation (10.20) of a 
hyperbola proves the theorem 10.1. The canonical equation leads 
(10.20) to the following important inequality: 


|z| > a. (10.21) 


THEOREM 10.2. The canonical equation of a hyperbola (10.20) 
is equivalent to the initial equation (10.12). 


Proor. The proof of the theorem 10.2 is analogous to the 
proof of the theorem 6.2. In order to prove the theorem 10.2 we 
calculate the expressions (10.10) relying on the equation (10.20). 
The equation (10.20) itself can be written as follows: 

b2 


E 3 22 
pan” — b°. (10.22) 


Substituting (10.22) into the first formula (10.10), we get 


p2 
|MFi| = 4/5 2? -b Hr? +2rce e? H= 
a 


2192 
Te = x2+2ac+ (c? — 62). 
a 


Now we take into account the relationship (10.18) and write the 
equality (10.23) in the following form: 


2 2 2 
IMF,| = 4/02 +2rc+ Š r? = (=) ; (10.24) 
a? a 


Upon calculating the square root the formula (10.24) yields 


(10.23) 


ja? + c2| 


|MF,| = (10.25) 
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From the inequalities (10.8) and (10.21) we derive |cz| > a?. 
Therefore the formula (10.25) can be written as follows: 


7 2 
airije ee NE calla, (10.26) 


a a 
In the case of the second formula (10.10) the considerations, 
analogous to the above ones, yield the following result: 


2 


c|z|— sign(x)a?  c|z]| 


|M F| = — sign(x) a. (10.27) 


a 


Let’s subtract the equality (10.27) from the equality (10.26). 
Then we get the following relationships: 


|MF;,| —|MF)| = 2 sign(z)a = +2a. (10.28) 


The plus sign in (10.28) corresponds to the case x > 0, which 
corresponds to the right branch of the hyperbola in Fig. 10.1. The 
minus sign corresponds to the left branch of the hyperbola. Due 
to what was said the equality (10.28) is equivalent to the equality 
(10.11), which in turn is equivalent to the equality (10.12). The 
theorem 10.2 is proved. 


Let’s consider again the inequality (10.21) which should be 
fulfilled for the x-coordinate of any point M on the hyperbola. 
The inequality (10.21) turns to an equality if M coincides with 
A or if M coincides with C (see Fig. 10.1). 


DEFINITION 10.3. The points A and C in Fig. 10.1 are called 
the vertices of the hyperbola. The segment [AC] is called 
the transverse axis or the real axis of the hyperbola, while the 
segments [OA] and [OC] are called its real semiazes. 


The constant a in the equation of the hyperbola (10.20) is 
the length of the segment [OA] in Fig. 10.1 (the length of the 
real semiaxis of a hyperbola). As for the constant b, there is no 
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segment of the length b in Fig. 10.1. For this reason the constant 
b is called the length of the imaginary semiazis of a hyperbola, 
i.e. the semiaxis which does not actually exist. 


DEFINITION 10.4. A coordinate system O, e1, e2 with an or- 
thonormal basis e1, €2 where a hyperbola is given by its canon- 
ical equation (10.20) is called a canonical coordinate system of 
this hyperbola. 


§11. The eccentricity and directrices of a 
hyperbola. The property of directrices. 


The shape and sizes of a hyperbola are determined by two 
constants a and 6 in its canonical equation (10.20).. Due to the 
relationship (10.18) the constant b can be expressed through the 
constant c. Multiplying both constants a and c by the same 
number, we change the sizes of a hyperbola, but do not change 
its shape. The ratio of these two constants 


Cc 
==. 11.1 
a ( ) 


is responsible for the shape of a hyperbola. 


DEFINITION 11.1. The quantity £ defined by the relationship 
(11.1), where a is the real semiaxis and c is the half of the 
interfocal distance, is called the eccentricity of a hyperbola. 


The eccentricity (11.1) is used in order to define one more 
parameter of a hyperbola. It is usually denoted through d: 


a 
d=-=—. 11.2 
z ( ) 


DEFINITION 11.2. On the plane of a hyperbola there are two 
lines perpendicular to its real axis and placed at the distance d 
given by the formula (7.2) from its center. These lines are called 
directrices of a hyperbola. 


180 CHAPTER II. GEOMETRY OF LINES AND SURFACES. 


Each hyperbola has two foci 
and two directrices. Each direc- 
trix has the corresponding focus 
of it. This is that of the two foci 
which is more close to the direc- 
trix in question. Let M(x,y) be 
some arbitrary point of a hyper- 
bola. Let’s connect this point 
with the left focus of the hyper- 
bola F; and drop the perpendic- 
ular from it to the left directrix 

Fig. 11.1 of the hyperbola. Let’s denote 
through Hı the base of such a 
perpendicular and calculate its length |M H,|: 


|M | = |x — (—d)| = |d + z|. (11.3) 
Taking into account (11.2), the formula (11.3) can be brought to 


_ |a+ez| 


az 
\MHy| = i +| (11.4) 
C 


C 

The length of the segment MF, was already calculated above. 
Initially it was given by one of the formulas (10.10), but later the 
more simple expression (10.25) was derived for it: 


la? + c2| 


|MF,| = (11.5) 


If we divide (11.5) by (11.4), we obtain the following relationship: 


IMFi] € 
EELS 11.6 
MHA] a mg 


The point M can change its position on the hyperbola. Then 
the numerator and the denominator of the fraction (11.6) are 
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changed, but its value remains unchanged. This fact is known as 
the property of directrices. 


THEOREM 11.1. The ratio of the distance from some arbitrary 
point M of a hyperbola to its focus and the distance from this 
point to the corresponding directrix is a constant which is equal 
to the eccentricity of the hyperbola. 


§12. The equation of a tangent line to a hyperbola. 


Let’s consider a hyperbola given by its canonical equation 

(10.20) in its canonical coordinate system (see Definition 10.4). 

Let’s draw a tangent line to this 

y hyperbola and denote through 

M = M(zo,yo) its tangency 

point (see Fig. 12.1). Our goal 

is to write the equation of a tan- 
gent line to a hyperbola. 

A hyperbola consists of two 
branches, each branch being a 
curve composed by two halves 
— the upper half and the lower 

Fig. 12.1 half. The upper halves of the hy- 
perbola branches can be treated 
as a graph of some function of the form 


y = f(z) (12.1) 


defined in the union of two intervals (—oo, —a) U (a, +00). Lower 
halves of a hyperbola can also be treated as a graph of some 
function of the form (12.1) with the same domain. The equation 
of a tangent line to the graph of the function (12.1) is given by 
the following well-known formula (see [9]): 


Y = yo + f’(xo) (x — z0). (12.2) 


In order to apply the formula (12.2) to a hyperbola we need 


182 CHAPTER II. GEOMETRY OF LINES AND SURFACES. 


to calculate the derivative of the function (12.1). Let’s substitute 
the function (12.1) into the equation (10.20): 


=1. (12.3) 


The equality (12.3) is fulfilled identically in x. Let’s differentiate 
the equality (12.3) with respect to x. This yields 


2x 2F(z) f"(2) 


s -en (12.4) 


Let’s apply (12.4) for to calculate the derivative f'(x): 


j B b x 
f(z) = Pha (12.5) 


In order to substitute (12.5) into the equation (12.2) we change 
x for zo and f(x) for f(xo) = yo. As a result we get 


(12.6) 


Let’s substitute (12.6) into the equation of the tangent line 
(12.2). This yields the following relationship 


b? To 
mre (x — zo). (12.7) 


Y = Yo + 
Eliminating the denominator, we write the equality (12.7) as 


a? y yo — b? £ £o = a? yp — b? xå. (12.8) 


Now let’s divide both sides of the equality (12.8) by a? b?: 


2 2 
T To Yyo _ To Yo 
awr qa 
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Note that the point M = M(2o,y9) is on the hyperbola. 
Therefore its coordinates satisfy the equation (10.20): 


2 2 
to Yo _ 
2 poh (12.10) 


Taking into account (12.10), we can transform (12.9) to 


LX yYyYo 
THEOREM 12.1. For a hyperbola determined by its canonical 
equation (10.20) its tangent line that touches this hyperbola at 
the point M = M(ao, yo) is given by the equation (12.11). 


The equation (12.11) is a particular case of the equation (3.22) 
where the constants A, B, and D are given by the formulas 


To Yo 
A= Baa D 


I 
= 


(12.12) 


According to the definition 3.6 and the formulas (3.21) the 
constants A and B in (12.12) are the covariant components of 
the normal vector for the tangent line to a hyperbola. The 
tangent line equation (12.11) is written in a canonical coordinate 
system of a hyperbola. The basis of such a coordinate system 
is orthonormal. Therefore the formula (3.19) and the formula 
(32.4) from Chapter I yield the following relationships: 


A=n=n', B=m =n. (12.13) 


THEOREM 12.2. The quantities A and B in (12.12) are the 
coordinates of the normal vector n for the tangent line to a hy- 
perbola which is given by the equation (12.11). 


The relationships (12.13) prove the theorem 12.2. 
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§13. Focal property of a hyperbola. 


Like in the case of an ellipse, assume that a hyperbola is 
manufactured of a thin strip of some flexible material and assume 
that its surface is covered with a light reflecting layer. For such 
a hyperbola we can formulate the following focal property. 


THEOREM 13.1. A light ray emitted in one focus of a hyper- 
bola upon reflecting on its surface goes to infinity so that its back- 
ward extension passes through the other focus of this hyperbola. 


THEOREM 13.2. The tangent line of a hyperbola is a bisector 
in the triangle composed by the tangency point and two foci of 
the hyperbola. 


The theorem 13.2 is a purely geometric version of the theo- 
rem 13.1. These theorems are equivalent due to the reflection 
law saying that the angle of re- 
flection is equal to the angle of 
incidence. 


y 


PROOF. Let’s consider some 
point M = M (zo, yo) on the hy- 
perbola and draw the tangent 
line to the hyperbola through 
this point as shown in Fig. 13.1. 
Let N be the x-intercept of this 
tangent line. Due to M and 
N, we have the segment [MN]. 
This segment is perpendicular to 
the normal vector n of the tangent line. In order to prove that 
[MN] is a bisector in the triangle F M F; it is sufficient to prove 
that n is directed along the bisector for the external angle of this 
triangle at its vertex M. This condition can be written as 


Fig. 13.1 


(FiıM,n) B (M F2,n) (13.1) 
IAM, — |MF| ` l 
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The coordinates of the points Fy, and F> in a canonical coor- 
dinate system of the hyperbola are known (see formulas (10.9)). 
The coordinates of the point M = M(z£o, yo) are also known. 
Therefore we can find the coordinates of the vectors FM and 
M Fy used in the above formula (13.1): 


To +c 
Yo 


FM = , MF; =| (13.2) 


—Y¥o 


The tangent line that touches the hyperbola at the point M = 
= M (20, yo) is given by the equation (12.11). The coordinates of 
the normal vector n of this tangent line in Fig. 13.1 are given by 
the formulas (12.12) and (12.13): 


n= (13.3) 


Relying upon (13.2) and (13.3), we apply the formula (33.3) from 
Chapter I in order to calculate the scalar products in (13.1): 


2 2 2 
——} CX +2 cx £ 
(FM, n) 0 o Yo 0 o Yo 


a? b2 a? a b’ 
rn? Czo— rô , YO _ CXo Yo ma 
MR n = SA oe 


The coordinates of the point M satisfy the equation (10.20): 


2 2 
ZT Yo _ 
2 er 1. (13.5) 


Due to (13.5) the formulas (13.4) simplify to 


2 

(F,M,n) = mys N 
j i , (13.6) 

(ie aso 7a" 


a? a 
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In order to calculate the denominators in the formula (13.1) 
we use the formulas (10.26) and (10.27). In this case, when 
applied to the point M = M (zo, yo), they yield 


c|xo| + sign(xo) a? 


|MF,| = — 
izol g (0) a? (13.7) 
|MF;| = c |zo| — sign (zo) a 
a 
Due to the purely numeric identity |ro| = sign(xo) zo we can 
write (13.7) in the following form: 
C£o + a: 
|M F| = ——— sign(zo), 
(13.8) 
|MFy| = 2" sign(z0). 


From the formulas (13.6) and (13.8) we easily derive the equalities 


(FiıM,n) _ sign(Zo) (M Fz,n) _ sign(zo) 
|M F| a , |M F | a 


that prove the equality (13.1). The theorem 13.2 is proved. 


As we noted above the theorem 13.1 is equivalent to the 
theorem 13.2 due to the light reflection law. Therefore the 
theorem 13.1 is also proved. 


§14. Asymptotes of a hyperbola. 


Asymptotes are usually understood as some straight lines to 
which some points of a given curve come unlimitedly close along 
some unlimitedly long fragments of this curve. Each hyperbola 
has two asymptotes (see Fig. 14.1). In a canonical coordinate 
system the asymptotes of the hyperbola given by the equation 
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(10.20) are determined by the 
following equations: 


b 
= ge 14.1 
z y z7 (14.1) 


One of the asymptotes is asso- 
ciated with the plus sign in the 
formula (14.1), the other asymp- 
tote is associated with the oppo- 
site minus sign. 

The theory of asymptotes is 
closely related to the theory of 
limits. This theory is usually studied within the course of 
mathematical analysis (see [9]). For this reason I do not derive 
the equations (14.1) in this book. 


Fig. 14.1 


$15. Parabola. Canonical equation of a parabola. 


y DEFINITION 15.1. A parabola 

M is a set of points on some plane 

JE each of which is equally distant 
from some fixed point F of this 
plane and from some straight 
line d lying on this plane. The 
point F is called the focus of 

x this parabola, while the line d is 
called its directriz. 


Assume that a parabola with 

the focus F and with the di- 

d rectrix d is given. Let’s drop 
the perpendicular from the point 

F onto the line d and denote 

through D the base of such a 

Fig. 15.1 perpendicular. Let’s choose the 
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line DF for the z-axis of a coordinate system. Then we denote 
through O the midpoint of the segment [DF] and choose this 
point O for the origin. And finally, we draw the y-axis of a 
coordinate system through the point O perpendicular to the line 
DF (see Fig. 15.1). Choosing the unity scales along the axes, we 
ultimately fix a coordinate system with an orthonormal basis on 
the parabola plane. 

Let’s denote through p the distance from the focus F of the 
parabola to its directrix d, i.e. we set 


|DF| =p. (15.1) 


The point O is the midpoint of the segment [DF]. Therefore the 
equality (15.1) leads to the following equalities: 


|DO| = |OF| =£. (15.2) 


NIV 


The relationships (15.2) determine the coordinates of D and F: 


Let M = M(x,y) be some arbitrary point of the parabola. 
According to the definition 15.1, the following equality is fulfilled: 


|IMF| =|MH| (15.4) 


(see Fig. 15.1). Due to (15.3) the length of the segment |M F] in 
the chosen coordinate system is given by the formula 


|MF| = yy? + (a — p/2)?). (15.5) 


The formula for the length of [MH] is even more simple: 


|MH| = z + p/2. (15.6) 
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Substituting (15.5) and (15.6) into (15.4), we get the equation 
P+ eB) = £ + 0/2. (15.7) 
Let’s square both sides of the equation (15.7): 
y? + (z — p/2)”) = (£ + p/2}. (15.8) 


Upon expanding brackets and collecting similar terms in (15.8), 
we bring this equation to the following form: 


y? =2pr. (15.9) 
DEFINITION 15.2. The equality (15.9) is called the canonical 


equation of a parabola. 


THEOREM 15.1. For each point M(x,y) of the parabola de- 
termined by the initial equation (15.7) its coordinates satisfy the 
canonical equation (15.9). 


Due to (15.1) the constant p in the equation (15.9) is a non- 
negative quantity. The case p = 0 corresponds to the degenerate 
parabola. From the definition 15.1 it is easy to derive that in 
this case the parabola turns to the straight line coinciding with 
the x-axis in Fig. 15.1. The case of the degenerate parabola is 
excluded by means of the inequality 


p>0. (15.10) 
Due to the inequality (15.10) from the equation (15.9) we derive 
2>0. (15.11) 


THEOREM 15.2. The canonical equation of the parabola (15.9) 
is equivalent to the initial equation (15.7). 


PROOF. In order to prove the theorem 15.2 it is sufficient 


CopyRight © Sharipov R.A., 2010. 


190 CHAPTER II. GEOMETRY OF LINES AND SURFACES. 


to invert the calculations performed in deriving the equality 
(15.9) from (15.7). Note that the passage from (15.8) to (15.9) is 
invertible. The passage from (15.7) to (15.8) is also invertible due 
to the inequality (15.11), which follows from the equation (15.9). 
This observation completes the proof of the theorem 15.2. 


DEFINITION 15.3. The point O in Fig. 15.1 is called the vertex 
of the parabola, the line DF coinciding with the x-axis is called 
the axis of the parabola. 


DEFINITION 15.4. A coordinate system O, e1, e2 with an or- 
thonormal basis e1, eg where a parabola is given by its canonical 
equation (15.9) and where the inequality (15.10) is fulfilled is 
called a canonical coordinate system of this parabola. 


§16. The eccentricity of a parabola. 


The definition of a parabola 15.1 is substantially different 
from the definition of an ellipse 6.1 and from the definition of a 
hyperbola 10.1. But it is similar to the property of directrices of 
an ellipse in the theorem 7.1 and to the property of directrices 
of a hyperbola in the theorem 11.1. Comparing the definition 
of a parabola 15.1 with these theorems, we can formulate the 
following definition. 


DEFINITION 16.1. The eccentricity of a parabola is postulated 
to be equal to the unity: € = 1. 


$817. The equation of a tangent line to a parabola. 


Let’s consider a parabola given by its canonical equation 
(15.9) in its canonical coordinate system (see Definition 15.4). 
Let’s draw a tangent line to this parabola and denote through 
M = M(ao, yo) the tangency point (see Fig. 17.1). Our goal is 
to write the equation of the tangent line to the parabola through 
the point M = M (zo, yo). 

An parabola is a curve composed by two halves — the up- 
per half and the lower half. Any one of these two halves of 
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a parabola can be treated as a 
graph of some function 


y = f(z) (17.1) 


with the domain (0, +00). The 
equation of a tangent line to 
the graph of a function (17.1) is 
given by the well-known formula 


Yy- Yo = 


= f' (ao) (z — z0). whe 


Fig. 17.1 
(see [9]). In order to apply the 
formula (17.2) to a parabola one should calculate the derivative 
of the function (17.1). Let’s substitute (17.1) into the equation 
(15.9): 
(Fo)? = 2px. (17.3) 
The equality (17.3) is fulfilled identically in x. Let’s differentiate 
the equality (17.3) with respect to x. This yields 


2 f(x) f(x) = 2p. (17.4) 
Let’s apply the formula (17.4) for to calculate the derivative 


i (17.5) 


In order to substitute (17.5) into the equation (17.2) we change 
x for zo and f(x) for f(xo) = yo. As a result we get 


f'(xo) = A (17.6) 


Let’s substitute (17.6) into the equation of the tangent line 
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(17.2). This yields the following relationship: 


Y — Yo = = (x — xo). (17.7) 
Yo 


Eliminating the denominator, we write the equality (17.7) as 


Y Yo — Yo = PT — p xo. (17.8) 


Note that the point M = M (zo, yo) is on the parabola. Therefore 
its coordinates satisfy the equality (15.9): 


yé = 2p zo. (17.9) 
Taking into account (17.9), we can transform (17.8) to 
Y Yo = pT + p To. (17.10) 


This is the required equation of a tangent line to a parabola. 


THEOREM 17.1. For a parabola determined by its canonical 
equation (15.9) the tangent line that touches this parabola at the 
point M = M (zo,yo) is given by the equation (17.10). 


Let’s write the equation of a tangent line to a parabola in the 
following slightly transformed form: 


p£ — yy + px = 0. (17.11) 


The equation (17.11) is a special instance of the equation (3.22) 
where the constants A, B, and D are given by the formulas 


A=p, B=-y0, D = pxo. (17.12) 
According to the definition 3.6 and the formulas (3.21), the 


constants A and B in (17.12) are the covariant components 
of the normal vector of a tangent line to a parabola. The 
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equation (17.11) is written in a canonical coordinate system of 
the parabola. The basis of a canonical system is orthonormal 
(see Definition 15.4). In the case of an orthonormal basis the 
formula (3.19) and the formula (32.4) from Chapter I yield 


A=n =r}, B=m =n. (17.13) 


THEOREM 17.2. The quantities A and B in (17.12) are the 
coordinates of the normal vector n of a tangent line to a parabola 
in the case where this tangent line is given by the equation (17.10). 


The relationships (17.13) prove the theorem 17.2. 


$18. Focal property of a parabola. 


Assume that we have a parabola manufactured of a thin strip 
of some flexible material covered with a light reflecting layer. For 
such a parabola the following focal property is formulated. 


THEOREM 18.1. A light ray emitted from the focus of a para- 
bola upon reflecting on its surface goes to infinity parallel to the 
axis of this parabola. 


THEOREM 18.2. For any tan- 

y gent line of a parabola the trian- 

M gle formed by the tangency point 

H M, its focus F, and by the point 

N at which this tangent line in- 

tersects the axis of the parabola 

d is an isosceles triangle, i.e. the 
equality |M F| = |N F| holds. 


x 
As we see in Fig. 18.1, the 
D N O F theorem 18.1 follows from the 
theorem 18.2 due to the reflec- 
Fig. 18.1 tion law saying that the angle of 


reflection is equal to the angle of 
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incidence and due to the equality of inner crosswise lying angles 
in the intersection of two parallel lines by a third line (see [6]). 


PROOF OF THE THEOREM 18.2. For to prove this theorem we 
choose some tangency point M (zo, yo) and write the equation of 
the tangent line to the parabola in the form of (17.10): 


YYo = pT + p £o. (18.1) 


Let’s find the intersection point of the tangent line (18.1) with the 
axis of the parabola. In a canonical coordinate system the axis 
of the parabola coincides with the x-axis (see Definition 15.3). 
Substituting y = 0 into the equation (18.1), we get « = —Zo, 
which determines the coordinates of the point N: 


N = N(—2o,0). (18.2) 
From (18.2) and (15.3) we derive the length of the segment |N F]: 
INF] = p/2 — (—z0) = p/2 + zo. (18.3) 


In the case of a parabola the length of the segment [MF] coin- 
cides with the length of the segment [MH] (see Definition 15.1). 
Therefore from (15.3) we derive 


|MF| =|MB| = zo — (—p/2) = £o + p/2. (18.4) 


Comparing (18.3) and (18.4) we get the required equality |M F| = 
|NF|. As a result the theorem 18.2 is proved. As for the 
theorem 18.1, it equivalent to the theorem 18.2. 


§19. The scale of eccentricities. 


The eccentricity of an ellipse is determine by the formula (7.1), 
where the parameters c and a are related by the inequalities (6.4). 
Hence the eccentricity of an ellipse obeys the inequalities 


0<e<1. (19.1) 
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The eccentricity of a parabola is equal to unity by definition. 
Indeed, the definition 16.1 yields 


e=l. (19.2) 


The eccentricity of a hyperbola is defined by the formula 
(11.1), where the parameters c and a obey the inequalities (10.8). 
Hence the eccentricity of a hyperbola obeys the inequalities 


1<e< too: (19.3) 


The formulas (19.1), (19.2), and (19.3) show that the eccen- 
tricities of ellipses, parabolas, and hyperbolas fill the interval 
from 0 to +oo without omissions, i.e. we have the continuous 
scale of eccentricities. 


§ 20. Changing a coordinate system. 


Let O, e1, e2, e3 and O, &1, 9, é3 be two Cartesian coordi- 
nate systems in the space E. They consist of the bases e1, e2, e3 
and €1, 9, č complemented with two points O and O, which 
are called origins (see Definition 1.1). The transition from the 
basis e1, €2, e3 to the basis €;, €2, č and vice versa is described 
by two transition matrices S and T whose components are in the 
following transition formulas: 


3 3 
a =D Sie =e æ) 
i=] i=l 


(see formulas (22.4) and (22.9) in Chapter I). 
In order to describe the transition from the coordinate system 


O, e1, €2, e3 to the coordinate system O, €1, č2, č and vice 
versa two auxiliary parameters are employed. These are the 
—> —> 


vectors a= OO and ã = ÕO . 
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=F = 
DEFINITION 20.1. The vectors a = OO and a = OO are 
called the origin displacement vectors. 


=a 
The origin displacement vector a = OO is usually expanded 
in the basis e1, e2, e3, while the other origin displacement vector 


& = OO is expanded in the basis 6, č2, é3: 


3 3 
a= de; a= 5 ü ë. (20.2) 
i=l i 


— = 
The vectors a= OO are a= OO opposite to each other, i.e. the 
following relationships are fulfilled: 


a = —ã, a=-—a. (20.3) 


Their coordinates in the expansions (20.2) are related with each 
other by means of the formulas 


3 3 
ä=-Y Tiai, d =—S > Stal. (20.4) 
j=l j=l 


The formulas (20.4) are derived from the formulas (20.3) with 
the use of the formulas (25.4) and (25.5) from Chapter I. 


§21. Transformation of the coordinates of a 
point under a change of a coordinate system. 


Let O, e1, e2, e3 and O, č, 2, č be two Cartesian coordi- 
nate systems in the space E and let X be some arbitrary point in 
the space E. Let’s denote through 


X = X(x, 2°, x’), X = X(ž', 2,23) (21.1) 
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the presentations of the point X in these two coordinate systems. 
The radius vectors of the point X in these systems are related 
with each other by means of the relationships 


ry =a+ry, rx =at+rx, (21.2) 


The coordinates of X in (21.1) are the coordinates of its radius 
vectors in the bases of the corresponding coordinate systems: 


3 
re =) e; Fx =) e (21.3) 
j=l j=l 
From (21.2), (21.3), and (20.2), applying the formulas (25.4) and 
(25.5) from Chapter I, we easily derive the following relationships: 


3 
fa ea, = Bo a (21.4) 
j=l j=l 
THEOREM 21.1. Under a change of coordinate systems in the 
space E determined by the formulas (20.1) and (20.2) the coordi- 
nates of points are transformed according to the formulas (21.4). 


The formulas (21.4) are called the direct and inverse transfor- 
mation formulas for the coordinates of a point under a change of 
a Cartesian coordinate system. 


§ 22. Rotation of a rectangular coordinate 
system on a plane. The rotation matrix. 


Let O, e1, e2 and O, č, č be two Cartesian coordinate sys- 
tem on a plane. The formulas (20.1), (20.2), and (20.4) in this 
case are written as follows: 


2 


2 
6 = X Sie, g=] Te; (22.1) 
{=l 


1=1 
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2 
a=) de; a= So a'é, (22.2) 


i=1 i=l 
2 2 
a@=-S°Ti a’, a’ =—S° Sia, (22.3) 
j=l j=l 


Let X be some arbitrary point of the plane. Its coordinates in 
the coordinate systems O, e1, eg and O, č, €2 are transformed 
according to the formulas similar to (21.4): 


2 2 
= Yr ai +a’, gi = Do ZÍ +a’. (22.4) 
j=1 


j=1 


Assume that the origins O and O do coincide. In this case 
the parameters a!, a? and a’, &? in the formulas (22.2), (22.3), 
(22.3), and (22.4) do vanish. Un- 
der the assumption O = O we 
consider the special case where 
the bases e;, eg and &1, č both 
are orthonormal and where one 
of them is produced from the 
other by means of the rotation 
by some angle ọ (see Fig. 22.1). 

Fig. 22.1 For two bases on a plane the 

transition matrices S and T are 

square matrices 2 x 2. The components of the direct transition 
matrix S are taken from the following formulas: 


č =cosy-e, + sin: eg, 
re : (22.5) 
e2 = — SIN Y : €1 + COS Y : eg. 


The formulas (22.5) are derived on the base of Fig. 22.1. 
Comparing the formulas (22.5) with the first relationship in 
(22.1), we get S1 = cosy, S? = sing, S? = — sin ọ, S2 = cosy. 
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Hence we have the following formula for the matrix S: 


cosy -sino (22.6) 


siny cosy 

DEFINITION 22.1. The square matrix of the form (22.6) is 
called the rotation matrix by the angle ọ. 

The inverse transition matrix T is the inverse matrix for S 
(see theorem 23.1 in Chapter I). Due to this fact and due to the 
formula (22.6) we can calculate the matrix T: 
cos(—y) —sin(—y) 
sin(—y) — cos(—y) 


T= 


(22.7) 


The matrix (22.7) is also a rotation matrix by the angle y. But 
the angle ọ in it is taken with the minus sign, which means that 
the rotation is performed in the opposite direction. 

Let’s write the relationships (22.4) taking into account that 


a = 0 and a = 0, which follows from O = O, and taking into 
account the formulas (22.6) and (22.7): 


1 


&' = cos(y) x! + sin(y) 2, (228) 
Z? = —sin(y) x’ + cos(y) x, 

1 ~1 wd 
x` = cos(p) č — sin(p) z*, 

, (p) (2) (22.9) 


The formulas (22.8) and (22.9) are the transformation formulas 
for the coordinates of a point under the rotation of the rectangu- 
lar coordinate system shown in Fig. 22.1. 


§ 23. Curves of the second order. 


DEFINITION 23.1. A curve of the second order or a quadric 
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on a plane is a curve which is given by a polynomial equation of 
the second order in some Cartesian coordinate system 


Az’? +2Bry+Cy?+2D2+2Ey+F=0. (23.1) 


Here x = x! and y = z? are the coordinates of a point on a 


plane. Note that the transformation of these coordinates under 
a change of one coordinate system for another is given by the 
functions of the first order in zt and x? (see formulas (22.4)). For 
this reason the general form of the equation of a quadric (23.1) 
does not change under a change of a coordinate system though 
the values of the parameters A, B, C, D, E, and F in it can 
change. From what was said we derive the following theorem. 


THEOREM 23.1. For any curve of the second order on a plane, 
i.e. for any quadric, there is some rectangular coordinate system 
with an orthonormal basis such that this curve is given by an 
equation of the form (23.1) in this coordinate system. 


§ 24. Classification of curves of the second order. 


Let I’ be a curve of the second order on a plane given by 
an equation of the form (23.1) in some rectangular coordinate 
system with the orthonormal basis. Passing from one of such 
coordinate systems to another, one can change the constant 
parameters A, B, C, D, E, and F in (23.1), and one can always 
choose a coordinate system in which the equation (23.1) takes its 
most simple form. 


DEFINITION 24.1. The problem of finding a rectangular coor- 
dinate system with the orthonormal basis in which the equation 
of a curve of the second order I takes its most simple form is 
called the problem of bringing the equation of a curve IT to its 
canonical form. 


An ellipse, a hyperbola, and parabola are examples of curves 
of the second order on a plane. The canonical forms of the 
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equation (23.1) for these curves are already known to us (see 
formulas (6.15), (10.20), and (15.9)). 


DEFINITION 24.2. The problem of grouping curves by the 
form of their canonical equations is called the problem of classi- 
fication of curves of the second order. 


THEOREM 24.1. For any curve of the second order I there is 
a rectangular coordinate system with an orthonormal basis where 
the constant parameter B of the equation (23.1) for T is equal to 
zero: B=0. 


PROOF. Let O, e1, e2 be rectangular coordinate system with 
an orthonormal basis e;, e2 where the equation of the curve T 
has the form (23.1) (see Theorem 23.1). If B = 0 in (23.1), then 
O, e1, €2 is a required coordinate system. 

If B # 0, then we perform the rotation of the coordinate 
system O, e1, €2 about the point O by some angle y. Such a 
rotation is equivalent to the change of variables 


R 
l 
n 
E 
& 


z = cos(y) 
y =sin(y) č + cos(y) (24.1) 


in the equation (23.1) (see formulas (22.9)). Upon substituting 
(24.1) into the equation (23.1) we get the analogous equation 


A#+2BE9+CHP+2DFZ+2EG+F=0 (24.2) 
whose parameters A, B, C, D 
the parameters A, B, C, 
(23.1). For the parameter 


, D, E, and F are expressed through 
D, E, and F of the initial equation 
B in (24.2) we derive the formula 

B = (C — A) cos(y) sin(y) + B cos?(y) — 

(24.3) 


A 
— B sin?(y) = 2 5 sin(2 p) + B cos(2y). 
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Since B 4 0, we determine ọ as a solution of the equation 


A-C 
2”) = : 24.4 
ctg29) = Z (24.4) 
The equation (24.4) is always solvable in the form of 
m 1 A-C 
=--> — }. 24. 
o=37 3r ap ) eo 


Comparing (24.4) and (24.3), we see that having determined the 
angle y by means of the formula (24.5), we provide the vanishing 
of the parameter B = 0 in the rotated coordinate system. The 
theorem 24.1 is proved. 


Let’s apply the theorem 24.1 and write the equation of the 
curve I in the form of the equation (23.1) with B = 0: 


Av? +Cy+2D2e+2by+F =0. (24.6) 


The equation (24.6) provides the subdivision of all curves of the 
second order on a plane into three types: 


— the elliptic type where A 4 0, C #0 and the parameters A 
and C are of the same sign, i.e. sign(A) = sign(C); 

— the hyperbolic type where A 4 0, C # 0 and the parame- 
ters A and C are of different signs, i.e. sign(A) Æ sign(C); 

— the parabolic type where A = 0 or C = 0. 


The parameters A and C in (24.6) cannot vanish simultaneously 
since in this case the degree of the polynomial in (24.6) would be 
lower than two, which would contradict the definition 23.1. 

Curves of the elliptic type. If the conditions A 4 0, C £0, 
and sign(A) = sign(C) are fulfilled, without loss of generality we 
can assume that A > 0 and C > 0. In this case the equation 
(24.6) can be written as follows: 


oe A =0. (24.7) 


aa a) T 
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Let’s perform the following change of variables in (24.7): 


D E 

=- =ğ- 4, 24.8 
r=ġ- 7 v= (24.8) 
The change of variables (24.8) corresponds to the displacement of 
the origin without rotation (the case of the unit matrices S = 1 


and T = 1 in (22.4)). In addition to (24.8) we denote 


~ D? FP? 
F = F — — — —. 24.9 
y T (24.9) 
Taking into account (24.8) and (24.9), we write (24.7) as 
AZ? +08? +F =0, (24.10) 


where the coefficients A and C are positive: A > 0 and C > 0. 
The equation (24.10) provides the subdivision of curves of the 
elliptic type into three subtypes: 


— the case of an ellipse where F < 0; 
— the case of an imaginary ellipse where F > 0; 
— the case of a point where F = 0. 

In the case of an ellipse the equation (24.10) is brought to the 
form (6.15) in the variables and y. As we know, this equation 
describes an ellipse. 

In the case of an imaginary ellipse the equation (24.10) reduces 
to the equation which is similar to the equation of an ellipse: 


he (24.11) 


The equation (24.11) has no solutions. Such an equation de- 
scribes the empty set of points. 

The case of a point is sometimes called the case of a pair of 
imaginary intersecting lines, which is somewhat not exact. In 
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this case the equation (24.10) describes a single point with the 
coordinates = 0 and y = 0. 

Curves of the hyperbolic type. If the conditions A Æ 0, 
C # 0, and sign(A) ¥ sign(C) are fulfilled, without loss of 
generality we can assume that A > 0 and C < 0. In this case the 
equation (24.6) can be written as 


A(z+2)’-G(y-3)'+(r-74+4)=0, (24.12) 


where —C = C > 0. Let’s denote 
F=F-—4+—> (24.13) 


and then perform the following change of variables, which corre- 
sponds to a displacement of the origin: 


a y=9+5. (24.14) 


Due to (24.13) and (24.14) the equation (24.12) is written as 
AZ -8 +F =0, (24.15) 


where the coefficients A and C are positive: A > 0 and C > 0. 
The equation (24.15) provides the subdivision of curves of the 
hyperbolic type into two subtypes: 


— the case of a hyperbola where F Æ 0; g 
— the case of a pair of intersecting lines where F = 0. 


In the case of a hyperbola the equation (24.15) is brought 
to the form (10.20) in the variables and y. As we know, it 
describes a hyperbola. 

In the case of a pair of intersecting lines the left hand side of 
the equation (24.15) is written as a product of two multiplicands 
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and the equation (24.15) is brought to 


(VAZ + VCG)(VAz - VCH) =0. (24.16) 


The equation (24.16) describes two line on a plane that intersect 
at the point with the coordinates z = 0 andy = 0. 

Curves of the parabolic type. For curves of this type 
there are two options in the equation (24.6): A = 0, C #0 or 
C=0, A #0. But the second option reduces to the first one 
upon changing variables x = —y, y = z, which corresponds to 
the rotation by the angle y = 7/2. Therefore without loss of 
generality we can assume that A = 0 and C #0. Then the 
equation (24.6) is brought to 


yY +2Dxe+2Ey+F=0. (24.17) 


In order to transform the equation (24.17) we apply the change 
of variables y = y — E, which corresponds to the displacement of 
the origin along the y-axis. Then we denote F = F + E?. Asa 
result the equation (24.17) is written as 


7 +2D2+F=0. (24.18) 


The equation (24.18) provides the subdivision of curves of the 
parabolic type into four subtypes: 


— the case of a parabola where D F 0; 

— the case of a pair of parallel lines 
where D = 0 and F <0; 

— the case of a pair of coinciding lines 
where D = 0 and F = 0; 

— the case of a pair of imaginary parallel lines 
where D = 0 and F > 0. 


In the case of a parabola the equation (24.18) reduces to the 
equation (15.9) and describes a parabola. 
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_ In the case of a pair of parallel lines we introduce the notation 
F = —y? into the equation (24.18). As a result the equation 
(24.18) is written in the following form: 


(J + yo) (J — yo) = 0. (24.19) 


The equation (24.19) describes a pair of lines parallel to the 
y-axis and being at the distance 2 yo from each other. 

In the case of a pair of coinciding lines the equation (24.18) re- 
duces to the form J? = 0, which describes a single line coinciding 
with the y-axis. 

In the case of a pair of imaginary parallel lines the equation 
(24.18) has no solutions. It describes the empty set. 


§ 25. Surfaces of the second order. 
DEFINITION 25.1. A surface of the second order or a quadric 


in the space E is a surface which in some Cartesian coordinate 
system is given by a polynomial equation of the second order: 


Av’? +2Bry+Cy +2Drz+2Eyz+ 


25.1 
+F +2Gr+2Hy+2Iz+J=0. Pe 


Here x = xt, y = x”, z = x? are the coordinates of points 


of the space E. Note that the transformation of the coordinates 
of points under a change of a coordinate system is given by 
functions of the first order in x', x”, x? (see formulas (21.4)). For 
this reason the general form of a quadric equation (25.1) remains 
unchanged under a change of a coordinate system, though the 
values of the parameters A, B, C, D, E, F, G, H, I, and J can 
change. The following theorem is immediate from what was said. 


THEOREM 25.1. For any surface of the second order in the 
space E, i.e. for any quadric, there is some rectangular coordinate 
system with an orthonormal basis such that this surface is given 
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by an equation of the form (25.1) in this coordinate system. 


§ 26. Classification of surfaces of the second order. 


The problem of classification of surfaces of the second order 
in E is solved below following the scheme explained in §2 of 
Chapter VI in the book [1]. Let S be surface of the second order 
given by the equation (25.1) on some rectangular coordinate 
system with an orthonormal basis (see Theorem 25.1). Let’s 
arrange the parameters A, B, C, D, E, F, G, H, I of the 
equation (25.1) into two matrices 


A BD G 
F=||B C EH, D=|H|. (26.1) 
D EF I 


The matrices (26.1) are used in the following theorem. 


THEOREM 26.1. For any surface of the second order S there is 
a rectangular coordinate system with an orthonormal basis such 
that the matrix F in (26.1) is diagonal, while the matrix D is 
related to the matrix F by means of the formula F -D = 0. 


The proof of the theorem 26.1 can be found in [1]. Applying 
the theorem 26.1, we can write the equation (25.1) as 


Ar?’ + Cy’ +F? +2Gr+2Hy+2Iz+J=0. (26.2) 


The equation (26.2) and the theorem 26.1 provide the subdivision 
of all surfaces of the second order in E into four types: 


— the elliptic type where A 4 0, C # 0, F Æ 0 and the 
quantities A, C, and F are of the same sign; 

— the hyperbolic type where A 4 0, C40, F Æ 0 and the 
quantities A, C and F are of different signs; 

— the parabolic type where exactly one of the quantities A, 
C. and F is equal to zero and exactly one of the quantities 
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G, H, and I is nonzero. 
— the cylindrical type in all other cases. 

Surfaces of the elliptic type. From the conditions A Æ 0, 
C #0, F £0 in (26.2) and from the condition F -D = 0 in 
the theorem 26.1 we derive G = 0, H = 0, and J = 0. Since 
the quantities A, C, and F are of the same sign, without loss of 
generality we can assume that all of them are positive. Hence for 
all surfaces of the elliptic type we can write (26.2) as 


A? +O +Fz?+J=0, (26.3) 


where A> 0, C > 0, and F > 0. The equation (26.3) provides 
the subdivision of surfaces of the elliptic type into three subtypes: 


— the case of an ellipsoid where J < 0; 
— the case of an imaginary ellipsoid where J > 0; 
— the case of a point where J = 0. 


The case of an ellipsoid is the most non-trivial of the three. In 
this case the equation (26.3) is brought to 


So mi (26.4) 


The equation (26.4) describes the surface which is called an 


Z z 


Fig. 26.1 Fig. 26.2 
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ellipsoid. This surface is shown in Fig. 26.1. 

In the case of an imaginary ellipsoid the equation (26.3) has 
no solutions. It describes the empty set. 

In the case of a point the equation (26.3) can be written in 
the form very similar to the equation of an ellipsoid (26.4): 


2 2 z2 


+% =0. (26.5) 


9 


The equation (26.5) describes a single point in the space E with 
the coordinates « = 0, y = 0, z = 0. 

Surfaces of the hyperbolic type. From the three conditions 
A#0,C #40, F £0 in (26.2) and from the condition F -D = 0 
in the theorem 26.1 we derive G = 0, H = 0, and J = 0. The 
quantities A, C, and F are of different signs. Without loss of 
generality we can assume that two of them are positive and one 
of them is negative. By exchanging axes, which preserves the 
orthogonality of coordinate systems and orthonormality of their 
bases, we can transform the equation (26.2) so that we would 
have A>0, C>0, and F <0. As a result we conclude that for 
all surfaces of the hyperbolic type the initial equation (26.2) can 
be brought to the form 


Ae? +Cy+F2+J=0, (26.6) 


where A > 0, C >0, F < 0. The equation (26.6) provides the 
subdivision of surfaces of the hyperbolic type into three subtypes: 


— the case of a hyperboloid of one sheet where J < 0; 
— the case of a hyperboloid of two sheets where J > 0; 
— the case of a cone where J = 0. 


In the case of a hyperboloid of one sheet the equation (26.6) 
can be written in the following form: 


a ees (26.7) 
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The equation (26.7) describes a surface which is called the hyper- 
boloid of one sheet. It is shown in Fig. 26.2. 
In the case of a hyperboloid of two sheets the equation (26.6) 
can be written in the following form: 
n ed (26.8) 
a bB e ` l 
The equation (26.8) describes a surface which is called the 
hyperboloid of two sheets. This surface is shown in Fig. 26.3. 


z zZ 


Fig. 26.3 Fig. 26.4 


In the case of a cone the equation (26.6) is transformed to the 
equation which is very similar to the equations (26.7) and (26.8), 
but with zero in the right hand side: 


lee Ape ci (26.9) 


The equation (26.9) describes a surface which is called the cone. 
This surface is shown in Fig. 26.4. 

Surfaces of the parabolic type. For this type of surfaces 
exactly one of the three quantities A, C, and F is equal to 
zero. By exchanging axes, which preserves the orthogonality of 
coordinate systems and orthonormality of their bases, we can 
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transform the equation (26.2) so that we would have A Æ 0, 
C #0, and F=0. From A #0, C #0, and from the condition 
F-D = 0 in the theorem 26.1 we derive G = 0 and H = 0. 
The value of J is not determined by the condition F -D = 0. 
However, according to the definition of surfaces of the parabolic 
type exactly one of the three quantities G, H, I should be 
nonzero. Due to G = 0 and H = 0 we conclude that J 40. Asa 
result the equation (26.2) is written as 


Ag’? +Cy?+2Iz2+J=0, (26.10) 


where A #0, C #0, and I #0. The condition J # 0 means that 
we can perform the displacement of the origin along the z-axis 
equivalent to the change of variables 


J 
— —. 26.11 
z>z aT ( ) 


Upon applying the change of variables (26.11) to the equation 
(26.10) this equation is written as 


Az’? +Cy?+2Iz=0, (26.12) 


where A#0, C #0, 140. The equation (26.12) provides the 
subdivision of surfaces of the parabolic type into two subtypes: 


— the case of an elliptic paraboloid where the quantities 
A#0 and C £0 are of the same sign; 

— the case of a hyperbolic paraboloid, where the quantities 
A#0and C ¥ 0 are of different signs. 


In the case of an elliptic paraboloid the equation (26.12) can 
be written in the following form: 


at eae (26.13) 


The equation (26.13) describes a surface which is called the 
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Fig. 26.5 Fig. 26.6 


elliptic paraboloid. This surface is shown in Fig. 26.5. 
In the case of a hyperbolic paraboloid the equation (26.12) can 
be transformed to the following form: 


a (26.14) 


The equation (26.14) describes a saddle surface which is called 
the hyperbolic paraboloid. This surface is shown in Fig. 26.6. 

Surfaces of the cylindrical type. According to the results 
from §2 in Chapter VI of the book [1], in the cylindrical case 
the dimension reduction occurs. This means that there is a 
rectangular coordinate system with an orthonormal basis where 
the variable z drops from the equation (26.2): 


Az’?+Cy?+2Gzr+2Hy+J=0. (26.15) 


The classification of surfaces of the second order described by the 
equation (26.15) is equivalent to the classification of curves of the 
second order on a plane described by the equation (24.6). The 
complete type list of such surfaces contains nine cases: 


— the case of an elliptic cylinder; 


§26. CLASSIFICATION OF SURFACES ... 213 


— the case of an imaginary elliptic cylinder; 
— the case of a straight line; 


zZ 


Fig. 26.7 


Fig. 26.8 


the case of a hyperbolic cylinder; 

the case of a pair of intersecting planes; 

the case of a parabolic cylinder; 

the case of a pair of parallel planes; 

the case of a pair of coinciding planes; 

the case of a pair of imaginary parallel planes. 


In the case of an elliptic cylinder the equation (26.15) can be 
transformed to the following form: 


Fig. 26.9 


A (26.16) 


The equation (26.16) coincides 
with the equation of an ellipse 
on a plane (6.15). In the space 
E it describes a surface which is 
called the elliptic cylinder. This 
surface is shown in Fig. 26.7. 

In the case of an imaginary 
elliptic cylinder the equation 
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(26.15) can be transformed to the following form: 


Sa dee ae, (26.17) 


The equation (26.17) has no solutions. Such an equation de- 
scribes the empty set. 

In the case of a straight line the equation (26.15) can be 
brought to the form similar to (26.16) and (26.17): 


Beet (26.18) 


The equation (26.18) describes a straight line in the space coin- 
ciding with the z-axis. In the canonical form this line is given by 
the equations x = 0 and y = 0 (see (5.14)). 

In the case of a hyperbolic cylinder the equation (26.15) can 
be transformed to the following form: 


i a (26.19) 


The equation (26.19) coincides with the equation of a hyperbola 
on a plane (6.15). In the spatial case it describes a surface which 
is called the hyperbolic cylinder. It is shown in Fig. 26.8. 

The next case in the list is the case of a pair of intersecting 
planes. In this case the equation (26.15) can be brought to 


es (26.20) 


The equation (26.20) describes the union of two intersecting 
planes in the space given by the equations 


uv yg x 
= >=) = = 0. 26.21 


ols 
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The planes (26.21) intersect along a line which coincides with the 
z-axis. This line is given by the equations x = 0 and y = 0. 

In the case of a parabolic cylinder the equation (26.15) reduces 
to the equation coinciding with the equation of a parabola 


y? =2pz. (26.22) 


In the space the equation (26.22) describes a surface which is 
called the parabolic cylinder. This surface is shown in Fig. 26.9. 

In the case of a pair of parallel planes the equation (26.15) is 
brought to y? — yê = 0, where yo Æ 0. It describes two parallel 
planes given by the equations 


Y = Yo, y = —Yo- (26.23) 


In the case of a pair of coinciding planes the equation (26.15) 
is also brought to y? — y = 0, but the parameter yo in it is equal 
to zero. Due to yo = 0 two planes (26.23) are glued into a single 
plane which is perpendicular to the y-axis and is given by the 
equation y = 0. 

In the case of a pair of imaginary parallel planes the equation 
(26.15) is brought to y? +y = 0, where yo Æ 0. Such an equation 
has no solutions. For this reason it describes the empty set. 
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